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1.  Introduction 


1.1  Improving  mutation  detection  in  the  NFl  gene  (Technical  Objective  11 

Mutation  detection  in  NFl  is  difficult  due  to  its  large  size,  approximately  350  kb  of  genomic  DNA,  and 
1 1  kb  of  mRNA.  During  this  period  of  support  from  DAMD17-93-J-3044,  my  laboratory  contributed 
to  the  final  elucidation  of  all  intron-exon  boundaries  in  the  NFl  gene  ^  (see  attached  publication  in  the 
Appendix).  Also  under  support  from  this  grant,  my  laboratory  helped  identify  multiple  homologous  loci 
oiNFl  that  complicate  mutation  detection  using  genomic  DNA  as  template,  and  we  designed  primers 
and  reaction  conditions  for  PCR  that  allow  AFZ-locus  specific  amplification  of  every  NFl  exon  in 
genomic  DNA  despite  the  presence  of  these  homologous  loci  b  2  (see  attached  publications  in  the 
Appendix).  The  work  of  co-author  Heidi  Huntsman  Breidenbach  on  both  these  projects  was  carried  out 
while  she  was  employed  as  a  full-time  technician  in  my  laboratory,  paid  entirely  from  DAMD 17-93 -J- 
3044. 

Although  these  advances  in  our  knowledge  of  the  NFl  locus  at  the  genomic  level  have  laid  the 
foundation  for  genomic  DNA-based  mutation  detection  mNFl,  we  nevertheless  chose  to  focus  our 
mutation  detection  efforts  on  NFl  mRNA,  due  to  the  difficulties  of  establishing  a  rapid  genomic  DNA- 
based  screen  of  the  60  NFl  exons,  and  the  pitfalls  that  allow  major  classes  of  mutation  (such  as  intron 
to  intron  inversions)  to  be  missed  in  exon  by  exon  screening  at  the  genomic  DNA  level  We  had 
shown  prior  to  the  beginning  of  the  DAMD  1 7-93 -J-3 044  funding  period  that  the  frequency  with  which 
heterozygosity  can  be  detected  at  polymorphic  bp  2034  in  the  ORF  oiNFl  is  the  same  in  NFl  patient 
mRNA,  NFl  patient  genomic  DNA,  and  control  genomic  DNA  (testing  approximately  150  NFl  patients 
and  100  controls).  Therefore,  nearly  all  NFl  patients  make  detectable  levels  of  mRNA  from  the 
disease-causing  allele.  This  finding  justified  a  major  effort  to  increase  the  speed  and  sensitivity  of 
detection  of  mutations  in  NFl  mRNA. 

1.1.1  Pros  and  cons  of  the  Protein  Truncation  Test  in  NFl  mutation  detection 

During  this  DAMD  17-93 -J-3 044  fimding  period  my  laboratory  developed  the  first  protein  truncation 
test  (PTT)  ior  NFl,  an  mRNA-based  assay.  The  general  method  for  the  PTT  was  first  described  by 
Sarkar  et  al.  \  our  approach  for  the  NFl  gene  incorporated  some  of  the  modifications  of  PTT 
suggested  for  detecting  mutations  in  the  ape  gene  We  fiirther  modified  the  method  to  take  advantage 
of  the  newly  available  long  PCR  method  Our  laboratory  was  the  first  to  publish  data  from  applying 

PTT  to  mutation  detection  in  NFl  .  Those  results  were  described  in  detail  in  my  21  January  1995 
interim  Progress  Report  for  this  DAMD17-93-J-3044  grant.  We  detected  mutations  in  12  out  of  25 
unrelated  NFl  patients  screened.  Since  then.  Dr.  Heim’s  laboratory  reported  mutation  detection  in  14 
of  21  unrelated  NFl  patients  screened  Roche  Molecular  Diagnostics  now  offers  a  clinical  diagnostic 
assay  based  on  Dr.  Heim’s  screening  method  that  claims  a  65%  chance  of  detecting  any  given  NFl 
patient’s  mutation. 

In  our  recent  report  of  long  RT-PCR  of  the  entire  NFl  ORF  ^  (see  attached  publication  in  the 
Appendix),  the  first  report  of  such  an  achievement,  we  discuss  the  limitations  of  the  PTT.  Because  the 
RT-PCR  products  that  can  be  effectively  analyzed  are  limited  in  size  to  approximately  2  kb  or  smaller, 
the  8.5  kb  NFl  ORF  must  be  amplified  as  a  series  of  five  or  six  overlapping  products.  Therefore, 
medium  to  large  deletions  that  span  the  region  of  overlap  of  two  adjacent  target  segments  for  RT-PCR 
amplification  will  fail  to  amplify.  In  these  situations,  no  product  will  be  made  from  the  RNA  of  the 
disease-causing  allele;  however,  product  of  normal  structure  will  still  be  made  from  the  mRNA  made  by 
the  remaining  normal  allele,  and  so  the  results  of  the  assay  will  appear  entirely  normal.  For  a  great 
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variety  of  medium-sized  to  large  deletions,  the  long  RT-PCR  assay  of  NFl  mRNA  ^  will  be  effective  in 
mutation  detection,  whereas  the  PTT  will  not.  During  her  work  on  this  project  Jennifer  Martinez  was  a 
full-time  technician  paid  entirely  from  DAMD17-93-J-3044. 

The  PTT  will  also  fail  to  detect  small,  in  frame  deletions  and  insertions  because  the  mutant 
proteins  that  are  synthesized  will  be  negligibly  altered  in  length  and  so  fail  to  resolve  from  the  normal 
length  proteins  made  from  the  normal  NFJ  mRNA.  PTT  will  also  fail  to  detect  missense  mutations, 
which  alter  amino  acid  sequence  without  altering  protein  length. 

Furthermore,  despite  various  modifications,  the  PTT  is  plagued  with  the  problem  that  multiple 
polypeptides  of  various  lengths,  all  shorter  than  the  protein  predicted  by  the  normal  ORF,  may  be 
synthesized  depending  on  the  particular  segment  being  screened,  even  when  the  template  is  prepared 
from  a  homozygous  control.  These  “extra  bands,”  most  likely  due  to  initiation  of  translation  at  multiple 
in-frame  methionine  codons  downstream  of  the  most  5’  methionine  codon  (the  codon  purposely 
designed  to  be  the  initiator),  lower  the  signal-to-noise  ratio  in  the  gel  lanes.  Many  faint  mutant  bands 
are  likely  to  be  missed  due  to  overlap  with  normal  extra  bands.  A  related  problem  with  the  PTT  is  that 
when  an  abnormality  in  a  patient  sample  is  detected,  it  often  consists  of  more  than  one  mutant  band  in 
the  gel  lane.  Presumably,  the  length  of  the  mutant  polypeptide  with  the  highest  molecular  weight 
corresponds  to  the  distance  from  the  initiator  methionine  codon  (at  the  5’  end  of  the  PCR  product)  to 
the  stop  codon  that  signals  a  premature  stop,  and  the  shorter  mutant  polypeptides  are  due  to  intiation  of 
translation  at  downstream  in-frame  methionine  codons  with  continued  translation  until  the  same  mutant 
premature  stop  is  encountered.  Therefore,  to  calculate  the  approximate  location  of  the  truncating 
mutation  in  the  2  kb  segment  being  screened,  one  must  find  the  mutant  band  of  highest  molecular 
weight.  Since  this  band  is  often  obscured  by  normal  “extra  bands,”  with  this  strategy  one  will  often 
choose  the  wrong  300-500  bp  section  of  the  2  kb  segment  to  sequence  in  hunting  for  the  mutation  at 
the  sequence  level. 

1.1.2  Comprehensive  screening  for  length  abnormalities  in  the  NFl  ORF  in  RT-PCR  products 
Fully  63%  of  the  NFl  mutations  reported  to  the  NFl  Genetic  Analysis  Consortium  (October  3 1, 1995) 
should  result  in  deletions  or  insertions  of  various  sizes  in  the  NFl  mRNA  (115  mutations  out  of  182 
total  mutations  reported).  Since  essentially  any  length  abnormality  should  be  easily  detected  by 
conventional  agarose  gel  and  polyacrylamide  gel  electrophoresis,  with  no  “extra  bands”  expected  from 
control  samples,  and  the  expected  overall  sensitivity  for  detecting  NFl  mutations  by  this  approach 
(63%)  is  expected  to  be  similar  to  the  overall  sensitivity  of  the  PTT  approach  (50-67%),  we  decided  to 
explore  a  screening  algorithm  that  would  begin  simply  with  the  testing  of  NFl  mRNA  for  length 
abnormalities.  Let  us  refer  to  this  approach  as  Length  of  ORF  (LOO)  screening.  This  algorithm  would 
look  first  for  medium  to  large  deletions  and  insertions,  then  for  smaller  deletions  and  insertions,  down  to 
a  single  missing  or  extra  base.  Only  those  NFl  patient  samples  that  pass  these  assays  for  length 
abnormalities  would  then  be  subjected  to  assays  sensitive  to  single  base  substitutions  such  as  detection 
of  Single  Strand  Conformation  Polymorphisms  (SSCP)  Restriction  Endonuclease  Fingerprinting 
(REF)",  or  the  PTT. 

1 .2  Spectrum  and  significance  of  germline  NFl  mutations  (Technical  Objective  21 
This  project  addressed  the  following  questions  regarding  the  AF7  mutations  that  cause 
neurofibromatosis  1  disease.  Do  the  germline  NFl  mutations  of  patients  with  hereditary 
neurofibromatosis  1  differ  in  nature  or  location  from  the  constitutional  NFl  mutations  of  sporadic  (or 
“new  mutation”)  NFl  patients?  What  fraction  of  unrelated  NFl  patients  can  be  shown  to  have  an 
abnormality  of  NFl  mRNA  structure? 
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1.3  Spectmm  and  significance  of  somatically  acquired  mutations  (Technical  Objective  3) 

This  project  addressed  the  following  questions  regarding  the  NFl  mutations  of  somatic  cells  that 
contribute  to  the  development  of  sporadic  cancers  in  the  general  population  (individuals  without 
neurofibromatosis  1  disease).  In  which  types  of  sporadic  tumors  does  somatic  mutation  oiNFl  play  a 
role?  Is  the  spectrum  of  somatic  mutations  in  NFl  in  sporadic  tumors  different  from  the  spectrum  of 
germline  mutations  that  cause  neurofibromatosis  1  disease?  Is  the  spectrum  of  somatic  mutations  in 
NFl  different  in  different  tumor  types,  suggesting  different  roles  for  NFl  in  different  tissues? 

We  revised  the  original  strategy  proposed  in  DAMD17-93-J-3044  to  investigate  the  role  of 
somatic  NFl  mutations  in  the  development  of  sporadic  tumors.  Many  tumor  cell  lines  derived  from 
cancers  of  various  types  are  available  from  cell  and  tissue  banks.  RNA  can  be  readily  prepared  from 
these  cell  lines,  with  no  contamination  from  normal  tissue,  and  the  entire  NFl  ORF  rapidly  screened. 
Since  somatic  mutations  in  cell  lines  may  arise  during  culture,  the  finding  of  an  NFl  mutation  in  a  tumor 
cell  line  should  not  be  considered  definitive  evidence  that  a  somatic  NFl  mutation  contributed  to  that 
tumor’s  development  in  vivo.  Nevertheless,  it  seems  likely  that  the  relative  frequencies  o^NFl 
mutations  in  different  tumor  types  in  vivo  will  be  roughly  reflected  in  cell  lines  established  from  tumors 
of  those  types.  Thus,  the  data  collected  from  surveying  tumor  cell  lines  for  NFl  mutations  should  allow 
us  to  prioritize  our  subsequent  searches  for  NFl  mutations  in  never-cultured  tumor  specimens. 
Furthermore,  if  AF7  mutations  found  in  cell  lines  representing  a  particular  tumor  type  turned  out  to  be 
restricted  to  a  particular  segment  of  the  gene,  subsequent  efforts  to  screen  tumor  specimens  of  that  type 
could  reasonably  be  focused  to  that  segment.  Finally,  totally  negative  results  obtained  from  screening 
cell  lines  from  a  particular  tumor  type  would  justify  omitting  the  corresponding  tumor  specimens  from 
subsequent  screens. 

2.  Materials  and  Methods 

2.1  Cell  lines 

Epstein  Barr  Virus  -  transformed  lymphoblast  lines  from  NFl  patients  were  provided  by  Dr.  Ray  White 
and  Dr.  Ken  Ward  of  the  University  of  Utah.  Lymphoblast  cell  lines  were  grown  in  RPMI-1640 
medium  with  15%  fetal  bovine  serum  (FBS)  in  a  humidified  37°C  incubator  in  the  presence  of  5%  CO2. 
Tumor  cell  lines  from  ATCC  (see  Table  3)  were  grown  as  recommended  by  ATCC.  Glioblastoma  cell 
lines  JBSA,  PFAT(l),  and  PFGC(3)  were  grown  in  Dr.  Fults’  laboratory  in  DMEM  +  10%  FBS  in  the 
presence  of  10%  CO2.  Colon  cancer  cell  lines  were  grown  in  Dr.  Markowitz’s  laboratory,  as  described 
in  ref  12  and  its  citations. 

2.2  RNA  preparation  and  first  strand  cDNA  synthesis 

Total  RNA  preparation  from  all  cell  lines  and  whole  blood,  and  first  stand  cDNA  synthesis  from  total 
RNA  were  as  described  in  reference  9  (included  in  the  Appendix),  except  that  RNA  preparation  from 
the  colon  cancer  cell  lines  was  performed  in  Dr.  Markowitz’s  laboratory  as  described  in  ref  12. 

2.3  Long  RT-PCR  of  the  entire  NFl  Open  Reading  Frame  (ORFI 
This  method  is  presented  in  reference  9  (see  Appendix). 

2.4  Restriction  digestion  of  long  NFl  RT-PCR  product  and  analysis  bv  agarose  gel  electrophoresis 
This  method  is  presented  in  reference  9  (see  Appendix). 

2.5  PCR  of  NFl  ORF  as  five  products  suitable  for  mutation  screening  bv  multiple  methods 

Five  overlapping,  approximately  2  kb  long  PCR  products  were  synthesized,  using  the  8.7  kb  NFl  RT- 
PCR  product  as  template.  These  2  kb  PCR  products  are  called  TnT  products  because  they  are  also 
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suitable  for  Transcription  and  Translation  studies.  Table  1  shows  the  sequences  of  these  primers  and 
the  sizes  of  the  products. 

For  each  of  the  five  products,  the  template  for  PCR  was  one  |al  of  a  1:10  dilution  in  water  of  the 
8.7  kb  long  NFl  RT-PCR  product.  The  final  components  of  the  PCR  were  0.2  mM  dNTPs,  lx  XL 
PCR  buffer  (Perkin-Elmer),  1.1  mMMg(OAc)2,  0.25  mM  spermidine,  0.4  pM  of  each  primer,  and  1 
unit  of  Taq  DNA  polymerase  in  a  final  volume  of  20  pi.  A  hot  start  procedure  was  followed,  using  a 
Perkin-Elmer  9600  thermal  cycler:  94°C  x  1.5’;  80°C  hold  while  enzyme  is  added;  (94°C  x  30",  54°C  x 
30",  65°C  X  4')  X  12  cycles;  72°C  x  10’. 

After  PCR,  the  entire  volume  of  each  TnT  product  was  loaded  into  a  single  well  of  a  1-2% 
agarose  gel  in  a  Tris-acetate-EDTA  buffer  system  with  ethidium  bromide  in  both  the  gel  and  the  buffer, 
and  electrophoresis  was  carried  out  until  the  product  had  migrated  1-2  cm  into  the  gel.  The  bands 
containing  the  TnT  products  were  then  visualized  over  a  long  wave  UV  light  box  and  excised  with  a 
razor  blade.  The  DNA  was  extracted  from  the  gel  slices  by  the  GeneClean  11  method  (Bio  101), 
according  to  the  manufacturer’s  instructions.  Final  elutions  were  into  10  pi  of  water. 

To  radiolabel  one  strand  of  each  TnT  product,  the  purified  products  were  subjected  to  one 
round  of  denaturation,  annealing,  and  extension  in  the  presence  of  ^^P-dATP  and  one  of  the  primers 
from  the  original  2  kb  PCR  reaction.  For  TnTl,  TnT2,  TnT4,  and  TnT5  the  downstream  primer  was 
used  for  this  reaction;  for  TnT3  the  upstream  primer  was  used.  The  components  in  the  final  reaction 
mixture  were  2.5  pi  of  GeneClean  Il-purified  DNA,  lx  XL  PCR  buffer  (Perkin-Elmer),  0.02  mM 
dNTPs  (10  fold  lower  than  for  a  normal  PCR),  1.1  mM  Mg(OAc)2,  0.25  mM  spermidine,  0.5  units  of 
Taq  DNA  polymerase,  0.4  pM  primer,  and  0.8  pi  ^^P-dATP  (1 000-3 OOOCi/mmol,  lOmCi/ml,  Andotek 
Life  Sciences),  in  a  final  volume  of  lOpl.  The  sample  was  denatured  at  94°C  x  30”,  annealed  at  54°C  x 
30”,  and  extended  at  65°C  x  4’  (one  cycle  only). 

2.6  Restriction  digestion  of  the  five  products  and  analysis  bv  polyacrylamide  gel  electrophoresis 
All  restriction  endonuclease  digestions  were  performed  by  sequential  additions  of  reagents  to  the  10  pi 
radiolabeled  samples  synthesized  in  the  step  immediately  preceding  this  section.  All  digestions  were 
performed  in  the  presence  of  4mM  spermidine.  Reaction  components  and  steps  specific  to  each 
digestion  are  given  below. 

2.6. 1  Restriction  enzyme  digestion  of  TnTl 

Add  5  units  of  Mam  I  (Boehringer-Mannheim,  or  BM),  5  units  of  Sfu  I  (BM),  and  1.5  pi  of  lOx  Sfu  I 
buffer  (BM)  to  a  final  volume  of  15  pi;  incubate  at  37°C  x  1  hour.  Then  add  5  units  of  Bsr  I  in  a  1.0  pi 
volume  (New  England  Biolabs,  or  NEB)  and  incubate  at  65°C  for  1  hour. 

2.6.2  Restriction  enzyme  digestion  of  TnT2 

Add  5  units  of  Msp  I  (GIBCO-BRL)  and  1.5  pi  of  1  Ox  Msp  I  buffer  to  a  final  volume  of  15  pi;  incubate 
at  37°C  X  1  hour.  Then  add  5  units  of  Ava  II  (GIBCO-BRL),  and  NaCl  to  a  final  concentration  of  50 
mM,  in  a  final  volume  of  16.4  pi;  incubate  37°C  x  1  hour. 

2.6.3  Restriction  enzyme  digestion  of  TnT3 

Add  5  units  Hae  III  (GIBCO-BRL)  and  1.5  pi  of  lOx  Hae  HI  buffer  to  a  final  volume  of  15  pi;  incubate 
at  37°C  X  1  hour.  Then  add  5  units  of  EcoRI  (GIBCO-BRL),  and  NaCl  to  a  final  concentration  of  50 
mM,  in  a  final  volume  of  16  pi;  incubate  37°C  x  1  hour. 
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2.6.4  Restriction  enzyme  digestion  of  TnT4 

Add  5  units  Cac8  I  (mS),  1.5  |il  of  lOx  Cac8  I  buffer,  and  6  units  of  Pst  I  (NEB)  to  a  final  volume  of 
14.1  |xl;  incubate  at  37°C  x  1  hour.  Then  add  2.5  |li1  containing  5  units  of  Tfi  I  (NEB);  incubate  at  65°C 
X  1  hour. 

2.6.5  Restriction  enzyme  digestion  of  TnT5 

Add  5  units  of  Rsa  I  (GIBCO-BRL)  and  1.5  |j,l  of  lOx  Rsa  I  buffer  to  a  final  volume  of  15  (j,l;  incubate 
at  37°C  X  1  hour.  Then  add  5  units  of  Ava  I  (GIBCO-BRL),  and  NaCl  to  a  final  concentration  of  50 
mM;  incubate  at  37°C  x  1  hour. 

2.6.6  Fill-in  of  restriction  fragment  ends 

We  found  that  while  the  expected  radiolabeled  single-stranded  DNA  restriction  fi’agments  were 
observed  upon  denaturing  polyacrylamide  gel  electrophoresis,  each  fi-agment  was  accompanied  by 
additional  fragments  one  or  more  bases  longer  than  the  expected  length.  This  was  prevented  by 
completely  removing  unincorporated  dNTPs  following  the  radiolabeling  step,  but  prior  to  the  addition 
of  restriction  enzymes;  with  this  modification,  only  the  shortest  (i.e.  the  expected)  version  of  each 
restriction  fragment  was  observed.  Our  interpretation,  therefore,  is  that  one  or  more  fi'agments  longer 
than  the  one  expected  fi'om  restriction  digestion  were  generated  by  the  addition  of  dNTPs  to  the 
exposed  3’  ends  of  the  restriction  fragments  by  the  action  of  Taq  polymerase,  which  remained  present 
and  active  in  the  samples.  This  occurred  even  for  blunt-end  restriction  fragments,  due  to  Taq’s  well- 
known  ability  to  add  an  extra  3 ’-overhanging  base  to  double-stranded  DNA.  However,  these  base 
additions  did  not  go  to  completion,  probably  because  the  optimal  reaction  conditions  for  restriction 
digestion  were  sub-optimal  for  dNTP  incorporation  by  Taq  polymerase.  Therefore,  mbced  length 
populations  of  each  restriction  fragment  resulted,  which  would  greatly  complicate  detection  of  mutant 
bands  on  gels. 

As  an  alternative  to  completely  removing  unincorporated  dNTPs  prior  to  restriction  digestion, 
we  chose  to  optimize  the  fill-in  and  untemplated  base  addition  reactions  of  Taq  polymerase  in  order  to 
obtain  populations  of  restriction  fragments  of  uniform  length  (from  controls).  Following  restriction 
digestion,  described  above,  400  pi  of  TE"^  (10  mM  Tris,  0. 1  mM  EDTA)  was  added  to  each  sample. 

The  samples  were  then  loaded  onto  Microcon-50  spin  columns  (Amicon,  Inc.)  and  centrifuged  at 
14,000  X  g  for  6  minutes.  This  buffer  exchange  procedure  left  the  restriction  fragments  in  5  pi  of  TE"^. 
The  samples  were  then  brought  to  a  10  pi  final  volume  containing  0.25  units  of  Taq  polymerase,  0.1 
mM  dNTPs,  lx  XL  PCR  buffer  (Perkin-Elmer),  and  1.1  mMMg(OAc)2,  and  incubated  at  65°C  for  10- 
15  minutes. 

2.6.7  Polyacrylamide  gel  electrophoresis  under  denaturing  conditions 

Each  sample  was  analyzed  on  two  different  polyacrylamide  slab  gels,  one  optimized  for  resolving  the 
larger-sized  fragments,  the  other  optimized  for  resolving  smaller  fragments;  the  BRL  S2  electrophoresis 
apparatus  was  used  for  all  gels,  which  were  cast  38  cm  long  each.  Radiolabeled  samples  were  diluted 
1 : 1  with  a  solution  containing  95%  formamide,  20  mM  trisodium  EDTA  (pH  approx.  8.8),  0.05% 
bromphenol  blue  and  0.05%  xylene  cyanol.  The  samples  were  then  heated  at  90°C  for  3  minutes  (or 
80°C  for  5  minutes)  to  denature  the  DNA,  and  three  to  five  pi  were  loaded  per  gel  lane.  Gel  One  (for 
better  resolution  of  the  larger  fragments)  was  a  4.75%  sequencing  gel  made  with  Sequagel-4.75 
(National  Diagnostics),  according  to  the  manufacturer’s  instructions;  electrophoresis  was  carried  out  in 
the  presence  of  an  electrolyte  gradient^^.  At  the  beginning  of  the  run,  the  top  buffer  chamber  contained 
0.5xTBE  buffer,  and  the  bottom  chamber  contained  IxTBE.  After  electrophoresis  at  85W  constant 
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power  for  2  hours,  the  power  was  turned  off  and  the  bottom  buffer  was  brought  to  IM  NaAc  by  the 
addition  of  Vi  volume  of  3M  NaAc;  electrophoresis  was  then  continued  for  2  additional  hours  at  85  W. 
Gel  Two  (for  better  resolution  of  the  smaller  fragments)  was  a  conventional  6%  gel  made  with 
Sequagel-6  (National  Diagnostics).  Both  top  and  bottom  buffer  chambers  contained  IxTBE,  and 
electrophoresis  was  for  2  hours  at  85  W.  After  electrophoresis,  one  glass  plate  was  removed,  and  the 
gels  were  transferred  to  3MM  Whatman  blotting  paper,  covered  with  plastic  wrap,  and  dried  over  heat 
under  vacuum  on  a  Hoeffer  slab  gel  dryer.  The  plastic  wrapped  was  then  removed,  the  dried  gel 
exposed  to  Kodak  X-Omat  Scientific  Imaging  Film  at  -80°C  in  the  dark  for  1-5  days,  and  the  exposed 
film  developed  in  a  Kodak  film  developer. 

3.  Results  and  Discussion 

3.1  Improving  mutation  detection  in  the  NFl  gene  (Technical  Objective  11 

3.1.1  Detection  of  large  scale  deletions  in  NFl  mKNA 

Our  recent  report  on  long  RT-PCR  of  the  entire  NFl  ORF  and  analysis  of  the  products  on  agarose  gels, 
with  and  without  restriction  enzyme  digestion  ^  (see  Appendix)  demonstrated  the  utility  of  this  approach 
in  detecting  large  scale,  multi-exon  deletions  in  NFl  mRNA.  In  principle  this  technique  should  be  able 
to  detect  even  mutant  NFl  mRNAs  resulting  from  genomic  DNA  deletions  >  100  kb  in  size;  the  only 
requirements  are  that  the  deletion  must  lie  between  the  sites  where  the  PCR  primers  bind  (exon  1  and 
49),  and  a  stable  mRNA  must  be  made. 

3.1.2  Detection  of  small  scale  deletions  and  insertions  in  NFl  mRNA 

Figure  1  illustrates  the  strategy  for  detecting  small  deletions  and  insertions  in  the  NFl  ORF  using  cDNA 
as  template.  First,  five  overlapping,  approximately  2  kb  PCR  products  were  synthesized,  using  the  long 
NFl  RT-PCR  product  as  template.  These  2  kb  PCR  products  (called  TnT  products  because  they  are 
also  suitable  for  Transcription  and  Translation  studies)  were  then  subjected  briefly  to  agarose  gel 
electrophoresis,  followed  by  excision  and  purification  from  the  agarose  to  remove  unincorporated  PCR 
primers.  Next,  one  strand  was  radiolabeled  by  performing  a  single  round  of  denaturation,  annealing, 
and  extension  with  ^^P-dATP  present,  using  just  one  of  the  two  primers  present  during  the  original  2  kb 
PCR.  The  resulting  radiolabeled  products  were  then  cut  with  restriction  enz5nnes,  carefully  selected  for 
each  specific  TnT  product  based  on  its  known  restriction  map,  and  subjected  to  electrophoresis  in 
denaturing  polyacrylamide  gels.  Figure  1  shows  scanned  autoradiographs  of  gel  lanes  containing 
restriction-digested  TnT  products  from  normal  segments  of  the  NFl  ORF. 

To  test  whether  single  base  length  differences  were  being  resolved,  two  sets  of  size  standards 
differing  in  length  by  one  base  were  used.  The  first  set  consisted  of  two  PCR  products,  449  bp  and  448 
bp  long,  identical  in  sequence  except  for  the  presence  of  an  extra  base  at  one  end  of  the  longer  product. 
Radiolabeled  in  one  strand  only,  these  products  were  routinely  loaded  separately  and  as  a  mixture  on 
each  4.75%  gel.  In  Figure  2  these  standards  are  shown  after  migrating  approximately  halfway  through 
the  gel  (i.e.  after  migrating  approx.  19  cm).  This  scanned  autoradiograph  has  lower  resolution  than  the 
actual  film,  which  does  allow  two  distinct  bands  to  be  seen  in  the  lane  containing  the  mixture. 

The  second  set  of  size  standards  for  testing  the  resolution  of  single  base  length  differences  was 
generated  by  performing  the  restriction  digestion  of  the  radiolabeled  TnT3  product  in  the  absence  of 
dNTPs  and,  separately,  in  the  presence  of  dNTPs  and  Taq  DNA  polymerase.  Figure  3,  lane  a  shows 
two  fragments  from  TnT3  after  digestion  in  the  absence  of  dNTPs;  lane  b  shows  the  same  fragments 
after  digestion  in  the  presence  of  dNTPs  and  Taq  polymerase;  and  lane  c  shows  the  mixture  of  the  two 
sets  of  digestion  products.  Both  of  these  restriction  cuts  (with  Hae  III)  should  generate  blunt  ends; 
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therefore,  Taq  is  not  filling  in  recessed  3’  ends  here.  Rather,  in  the  samples  loaded  in  lane  b,  it  has 
added  a  single  extra  base  to  the  exposed  3’  terminus  of  the  radiolabeled  strand.  This  result 
demonstrates  that  restriction  fragments  differing  in  length  by  one  base  are  resolved  as  distinguishable 
bands  in  this  region  of  this  gel.  The  441,  324,  320,  257,  200,  159,  and  149  bp  restriction  fragments 
from  TnT3  are  all  blunt-ended  at  the  3’  end  of  the  radiolabeled  strand,  and  so  are  suitable  for  testing 
single  base  length  difference  resolution  by  this  approach. 

3.1.3  Detection  of  single  base  substitutions  in  NFl  mRNA 

This  project  did  not  have  sufficient  time  to  perform  comprehensive  screening  for  single  base 
substitutions  in  NFI  mRNA,  but  the  five  TnT  RT-PCR  products  that  we  have  designed  are  suitable  for 
such  studies.  First,  the  radiolabeled  restriction  fragments  firom  each  digested  TnT  product  can  be  heat- 
denatured,  then  subjected  to  SSCP  and/or  heteroduplex  analyses  in  non-denaturing  polyacrylamide  gels 
with  and  without  glycerol  (see  protocols  firom  the  AT  Biochem  division  of  FMC  corportation). 
Alternatively,  or  in  addition,  the  TnT  products  can  be  used  for  the  Protein  Truncation  Test,  since  the  5’ 
ends  of  each  TnT  product  have  been  designed  to  allow  transcription  and  subsequent  translation  of  the 
transcripts  Furthermore,  the  3’  ends  of  each  TnT  product  have  been  designed  so  that  all  S5mthesized 
polypeptides  that  reach  the  end  of  the  normal  reading  frame  should  acquire  a  run  of  histidine  residues 
that  in  principle  should  allow  their  removal  on  a  nickel-based  affinity  matrix  The  prematurely 
terminated  mutant  polypeptides,  which  should  not  bind  this  matrix,  can  then  be  eluted  and  analyzed  by 
SDS-PAGE.  This  approach  should  tremendously  increase  the  signal-to-noise  ratio  for  mutation 
detection  by  the  PTT.  (Note  that  we  have  introduced  a  single  base  change  near  the  3’  end  of  TnT5  so 
that  protein  made  from  the  normal  NFJ  sequence  will  not  terminate  at  the  normal  NFJ  stop  codon,  but 
will  read  through  and  still  acquire  the  primer-encoded  histidines).  However,  initial  efforts  to  remove 
full-length  proteins  by  this  approach  have  been  unsuccessful  (personal  communication.  Dr.  David 
Viskochil).  An  alternative  approach,  yet  to  be  explored  to  our  knowledge,  would  use  antibodies  to 
some  other  specific,  PCR-primer-encoded  carboxy-terminal  peptide  in  an  entirely  analogous  scheme  to 
remove  the  proteins  that  are  synthesized  based  on  the  normal  NFl  mRNA. 

3.2  Spectrum  and  significance  of  germline  NFl  mutations  (Technical  Objective  2> 

Our  data  from  studies  of  germline  NFl  mutations  are  summarized  in  Table  2.  RNA  samples  from  20 
unrelated  patients  with  hereditary  neurofibromatosis  1  and  13  unrelated  patients  with  sporadic  (or  “new 
mutation”)  neurofibromatosis  1  were  analyzed.  By  performing  the  long  RT-PCR  of  NFl  mRNA  only 
on  patients  already  known  (by  testing  of  genomic  DNA)  to  be  heterozygous  at  bp  702  in  exon  5,  we 
produced  RT-PCR  products  that  could  be  tested  for  representation  of  both  alleles.  All  of  the  long  RT- 
PCR  products  prepared  from  these  patients  showed  heterozygosity  at  bp  702  by  an  Rsal  restriction 
enz5mie  digestion-based  assay.  The  33  NFl  patients  were  screened  for  length  abnormalities  occurring 
anywhere  in  the  8.5  kb  NFl  ORF,  by  the  methods  described  above.  Six  of  the  33  showed  abnormalities 
in  the  Asel  +  Avill  restriction  digestion  products  released  from  the  8.7  kb  long  RT-PCR  product.  Each 
of  these,  upon  sequencing,  turned  out  to  be  due  to  a  single  exon  drop-out,  except  for  the  mutation  of 
patient  1 19-1,  which  was  a  30  bp  deletion  from  exon  9.  Eleven  additional  patients  showed 
abnormalities  when  the  five  TnT  products  were  digested  with  segment-specific  restriction  enzymes  and 
analyzed  by  PAGE  under  denaturing  conditions.  This  gives  a  total  detection  rate,  using  length-of-ORF 
(LOO)  screening,  of  17/33  (about  52%).  This  is  slightly  higher  than  our  detection  rate  with  the  Protein 
Truncation  Test  alone,  1 1/23  (about  48%).  Looking  only  at  the  23  patients  who  were  screened  by  both 
LOO  and  PTT  gives  us  an  overall  detection  rate  of  19/23  (about  83%).  We  think  it  is  likely  that  further 
screening  of  these  NFl  RNA  samples  by  other  methods  is  likely  to  identify  mutations  in  some  or  all  of 
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the  patients  showing  no  abnormalities  so  far.  For  example,  single  base  substitution  missense  mutations 
that  change  an  amino  acid  are  known  to  be  a  cause  of  neurofibromatosis  1  disease,  but  would  be 
systematically  missed  by  both  LOO  and  PIT.  Although  my  laboratory  will  not  be  able  to  perform  any 
further  work  to  confirm  and  fiirther  define  at  the  sequence  level  the  abnormalities  reported  here,  nor  be 
able  to  search  for  additional  mutations  in  this  patient  set  by  other  methods,  due  to  my  terminating  all  my 
work  related  to  the  NFl  gene,  I  will  give  a  copy  of  this  report  and  all  NFl  patient  samples  to  Dr.  David 
Viskochil  (who  continues  to  work  on  neurofibromatosis  1)  at  the  University  of  Utah,  so  that  he  may 
further  investigate  the  mutations  in  these  patients. 

Sub-localizing  the  site  of  a  mutation’s  underl5dng  sequence  change  within  the  affected  2  kb  TnT 
RT-PCR  product,  based  only  on  an  analysis  of  the  mutant  band  first  seen  by  denaturing  PAGE  of  that 
TnT  product’s  restriction  digest,  is  frequently  not  straightforward.  In  a  gel  lane  if  the  band 
corresponding  to  a  particular  normal  restriction  fragment  is  clearly  diminished  in  intensity  relative  to 
other  normal  bands,  then  the  mutation  must  affect  that  restriction  fragment  in  the  mutant  allele.  Thus, 
one  looks  for  a  new  band  of  abnormal  mobility,  but  also  for  changes  in  the  intensity  of  bands  of  normal 
mobility.  Unfortunately,  more  than  80%  of  NFl  patients  show  unequal  expression  from  the  two  NFl 
alleles  and  in  our  experience,  in  every  case  where  we  identified  a  patient’s  mutation  in  RNA  and 
unequal  expression  was  present,  the  mRNA  from  the  mutant  allele  was  diminished  in  abundance  relative 
to  that  from  the  normal  allele.  Therefore,  in  many  cases  a  faint  mutant  band  of  abnormal  mobility  is 
found  in  these  LOO  screens,  but  no  obvious  diminution  in  the  intensity  of  a  normal  band  accompanies  it 
to  help  narrow  the  site  of  mutation,  because  the  %  change  in  signal  intensity  is  small.  Nevertheless,  the 
screening  of  the  TnT  digestion  products  achieves  the  narrowing  of  the  mutation  location  to  one  2  kb 
segment  of  the  8.5  kb  ORF.  Our  approach  is  then  to  prepare  four  or  five  overlapping  PCR  products 
(each  400-500  bp  long)  spanning  the  2  kb  segment,  analyze  them  on  agarose  gels  (and,  if  necessary, 
polyacrylamide  gels),  identify  the  new  mutant  product,  and  sequence  it  directly. 

The  problem  of  detecting  single  base  deletions  and  insertions  deserves  special  attention.  In 
Table  2,  patient  92-5  is  the  only  patient  known  to  have  a  single  bp  deletion,  and  it  was  not  detected  by 
the  LOO  approach.  We  suspect  that  many  single  base  deletions  and  insertions  may  be  missed  on  these 
gels,  especially  in  the  higher  molecular  weight  restriction  fragments.  As  suggested  by  Figure  2,  single 
base  length  differences  in  these  relatively  large  fragments  are  only  barely  discernible.  Since  more  peak- 
to-peak  separation  is  required  to  distinguish  two  bands  of  unequal  intensity  than  to  distinguish  two 
bands  of  equal  intensity,  sensitive  screening  of  NFl  for  single  base  deletions  and  insertions,  when 
unequal  expression  is  the  rule,  will  be  difficult  to  achieve  by  this  approach.  Ways  around  this  problem, 
which  we  have  not  had  time  to  investigate,  include  using  other  sets  of  restriction  enzymes,  in  addition  to 
those  recommended  here,  to  cut  the  2  kb  PCR  products,  so  that  the  segments  previously  located  in  the 
largest  restriction  fragments  will  now  be  in  smaller  fragments,  for  which  resolution  is  better.  Another 
solution  is  to  routinely  heat  and  cool  the  digested  products  to  drive  heteroduplex  formation,  then 
subject  them  to  PAGE  on  non-denaturing  gels.  Heteroduplex-based  mutation  detection  is  thought  to  be 
nearly  100%  sensitive  in  the  detection  of  single  base  deletions  and  insertions. 

The  germline  NFl  mutations  of  patients  with  hereditary  neurofibromatosis  1  did  not  clearly 
differ  in  nature  or  location  from  the  constitutional  NFl  mutations  of  sporadic  (or  “new  mutation”)  NFl 
patients,  in  this  patient  set. 

3.3  Spectrum  and  significance  of  somatically  acquired  NFl  mutations  (Technical  Obiective  3) 

Table  3  summarizes  the  results  of  screening  for  somatic  mutations  in  the  NFl  gene  in  several  tumor  cell 
lines  established  from  sporadic  cancers  of  various  types  arising  in  individuals  without 
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neurofibromatosis  1  disease.  Unfortunately,  there  was  not  sufficient  time  to  confirm  and  further  define 
many  of  these  apparent  mutations  at  the  sequence  level.  In  interpreting  these  results  one  should  keep  in 
mind  the  possibility  that  somatic  NFl  mutations  may  occur  in  0,  1,  or  2  alleles  in  any  given  tumor  cell 
line  (unlike  the  case  for  constitutional  NFl  mutations  in  patients  with  neurofibromatosis  1).  For 
example,  breast  adenocarcinoma  MCF7  appeared  to  have  abnormalities  in  at  least  two  well-separated 
segments  of  the  NFl  mRNA;  this  is  consistent  with  the  possibility  that  one  mutation  is  present  in  mRNA 
from  one  allele  and  a  different  mutation  is  present  in  mRNA  fi'om  the  other  allele. 

Anomalous  bands  on  gels  and/or  sequence  changes  suggestive  of  abnormal  NFl  fimction  were 
found  in  2/2  breast  carcinoma  cell  lines,  7/17  colon  cancer  cell  lines,  5/6  glioblastoma  multiforme  cell 
lines,  and  1/2  neuroblastoma  cell  lines.  RNA  from  these  cell  lines  have  been  screened  to  date  only  by 
the  Length  of  ORF  screen.  Exon  dropouts  were  confirmed  at  the  cDNA  sequence  level  in  three  colon 
cancer  cell  lines  and  one  glioblastoma  multiforme  cell  line.  (It  should  be  noted  that  the  colon  cancer  cell 
lines  selected  for  NFl  screening  were  specifically  chosen  to  be  lines  that  were  negative  for  activating 
mutations  of  the  K-ras  protooncogene.)  Although  my  laboratory  will  not  be  able  to  perform  any  further 
work  to  confirm  and  fiirther  define  at  the  sequence  level  the  abnormalities  reported  here  (due  to  my 
terminating  all  my  work  related  to  the  NFl  gene),  I  will  notify  Drs.  Markowitz  and  Fults  of  these  results 
so  that  they  can  pursue  these  findings. 

These  results  suggest  that  NFl  undergoes  somatic  mutation  fi-equently  and  contributes  to  a 
variety  of  tumor  types.  Clearly,  much  more  screening  is  needed  to  get  reliable  estimates  of  NFl 
mutation  frequencies  in  tumor  cell  lines,  and  ultimately  in  fresh  tumor  specimens.  There  is  currently 
insufficient  data  to  determine  whether  the  spectrum  of  somatic  mutations  in  NFl  in  sporadic  tumors 
differs  significantly  from  the  spectrum  of  germline  mutations  that  cause  neurofibromatosis  1  disease,  and 
whether  the  spectrum  of  somatic  mutations  in  iVF7  varies  significantly  between  tumor  types  suggesting 
different  roles  for  NFl  in  different  tissues. 

4.  Conclusions 

This  research  project  had  three  technical  objectives.  Achieving  the  second  and  third  objectives  with 
large  numbers  of  samples  depends  strongly  on  achieving  the  first  objective.  Because  of  the  paramount 
importance  of  developing  efficient  mutation  detection  in  the  NFl  gene,  so  that  other  investigators  will 
be  able  to  make  rapid  progress  in  several  areas  of  NFl  research,  I  chose  to  focus  nearly  all  my  effort  on 
the  first  technical  objective.  I  believe  the  Length  of  ORF  assay  that  I  have  developed  is  a  major 
contribution  to  the  field.  If  this  assay  is  applied  first,  followed  by  assays  for  single  base  substitution 
mutations,  it  is  likely  that  more  than  90%  of  all  neurofibromatosis  1  -  causing  mutations  can  be 
identified  rapidly.  Somatic  mutations  in  NFl  appear  to  be  common  in  the  tumor  types  investigated  here. 
Further  work  by  others  will  be  needed  to  provide  an  accurate  assessment  of  the  frequency  and 
significance  of  somatic  NFl  mutations  in  human  tumors.  One  hopes  that  eventually,  an  assay  that  tests 
the  integrity  of  all  neurofibromin  functions  relevant  to  neurofibromatosis  1  disease  and  spontaneous 
tumorigenesis  will  be  developed  that  obviates  the  types  of  mutation  screening  that  I  have  developed. 
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Neurofibromatosis  1  maps  to  chromosome  band 
17qll.2,  and  the  NFl  locus  has  been  partially  charac¬ 
terized.  Even  though  the  full-length  NFl  cDNA  has 
been  sequenced,  the  complete  genomic  structure  of  the 
NFl  gene  has  not  been  elucidated.  The  5'  end  of  NFl 
is  embedded  in  a  CpG  island  containing  a  Notl  restric¬ 
tion  site,  and  the  remainder  of  the  gene  lies  in  the 
adjacent  350-kb  Noil  fragment.  In  our  efforts  to  de¬ 
velop  a  comprehensive  screen  for  NFl  mutations,  we 
have  isolated  genomic  DNA  clones  that  together  har¬ 
bor  the  entire  NFl  cDNA  sequence.  We  have  identified 
all  intron-exon  boundaries  of  the  coding  region  and 
established  that  it  is  composed  of  59  exons.  Further¬ 
more,  we  have  defined  the  3'-untranslated  region  (3'- 
UTR)  of  the  NFl  gene;  it  spans  approximately  3.5  kb 
of  genomic  DNA  sequence  and  is  continuous  with  the 
stop  codon.  Oligonucleotide  primer  pairs  S5ntithesized 
from  exon-flanking  DNA  sequences  were  used  in  the 
polymerase  chain  reaction  with  cloned,  chromosome 
17-specific  genomic  DNA  as  template  to  amplify  NFl 
exons  1  through  27b  and  the  exon  containing  the  3'- 
UTR  separately.  This  information  should  be  useful  for 
implementing  a  comprehensive  NFl  mutation  screen 
using  genomic  DNA  as  template.  ©  1995  Academic  Press,  Inc. 


INTRODUCTION 

Neurofibromatosis  type  1  (NFl)  afflicts  approxi¬ 
mately  1  in  3500  individuals  worldwide.  This  autoso¬ 
mal  dominant  condition  typically  manifests  cafe-au-lait 
macules  in  the  first  decade  of  life  and  cutaneous  neuro¬ 
fibromas  during  adolescence.  Approximately  5  of  af¬ 
flicted  individuals  display  assorted  medical  complica- 

Variable  lengths  of  intronic  sequence  on  both  sides  of  NFl  exons 
2  through  27b  have  been  deposited  with  GenBank  under  Accession 
Nos.  U17656~U17690.  Continuous  DNA  sequence  spanning  exons 
28  through  49  have  been  deposited  under  Accession  No.  L05367. 
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tions  involving  many  organ  systems,  and  they  carry  a 
small  but  increased  risk  for  the  development  of  malig¬ 
nancy.  In  addition  to  the  highly  pleiotropic  nature  of 
NFl,  the  clinical  manifestations  are  quite  variable, 
even  among  family  members  who  carry  the  same  dis¬ 
ease  allele.  The  NFl  gene  was  cloned  by  the  mapping 
approach  (Cawthon  et  aL,  1990a;  Viskochil  et  aL,  1990; 
Wallace  et  at,  1990),  and  its  cDNA  has  been  sequenced 
(Cawthon  et  aL,  1990a;  Wallace  et  aL,  1990;  Xu  et  aL, 
1990a;  Marchuk  et  aL,  1991).  The  coding  segment  of 
NFl  spans  approximately  350  kb  of  genomic  DNA,  and 
the  cDNA  is  8454  bp  long  (Marchuk  et  aL,  1991),  exclu¬ 
sive  of  two  alternative  splice  forms  that  contain  inser¬ 
tion  exons  of  54  (Cawthon  et  aL,  1990a)  and  63  (Nishi 
et  aL,  1991;  Suzuki  et  aL,  1991)  bp.  NFl  encodes  neu- 
rofibromin,  a  2818-amino-acid  pol3rpeptide  with  do¬ 
mains  of  homology  to  yeast  IRAl  and  IRA2  and  to 
mammalian  pl20GAP  (p21ras  GTPase-activating  Pro¬ 
tein)  (Buchberg  et  aL,  1990;  Xu  et  aL,  1990a).  The  do¬ 
main  demonstrating  homology  to  the  catalytic  domain 
of  pl20GAP  activates  the  intrinsic  GTPase  of  p21ras 
in  vitro  (Ballester  et  aL,  1990;  Martin  et  aL,  1990;  Xu 
et  aL,  1990b).  Other  functions  of  neurofibromin  have 
not  been  conclusively  demonstrated. 

A  detailed  understanding  of  gene  structure  at  the 
NFl  locus  is  paramount  in  evaluating  the  molecular 
basis  of  the  high  mutation  rate  of  NFl,  estimated  to 
be  one  of  the  highest  among  human  genetic  disorders 
(approximately  1/10,000  gametes  per  generation) 
(Carey  et  aL,  1986).  Even  though  the  NFl  cDNA  se¬ 
quence  is  known,  precise  identification  of  NFl  muta¬ 
tions  in  patients  has  not  been  very  fruitful;  mutations 
have  been  defined  in  fewer  than  20%  of  screened  NFl 
patients  (NNFF  International  NFl  Genetic  Analysis 
Consortium,  Fax:  (617)  277-5933)  and  include  translo¬ 
cations,  deletions,  insertions,  and  point  mutations  that 
are  generally  “inactivating.”  The  identification  of  exon 
boundaries  and  determination  of  intronic  sequences 
flanking  these  boundaries  would  enable  investigators 
to  adopt  additional  screening  approaches  to  facilitate 
the  further  evaluation  of  mutations  in  the  NFl  gene. 

The  availability  of  cloned  genomic  DNA  from  the  cos- 
mid  contig  containing  a  portion  of  the  NFl  cDNA 
(Viskochil  et  aL,  1990)  enabled  Cawthon  et  aL  (1990a) 
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initially  to  identify  nine  exon  boundaries  with  flanking 
intronic  sequence.  Subsequently,  we  used  the  same 
clones  to  define  the  boundaries  of  downstream  exons 
(this  report),  and  these  have  been  confirmed  by  DNA 
sequence  generated  by  Weiss  et  aL  (1992).  Finally,  NFl 
cDNAs  mapping  further  5'  and  extending  centromeric 
of  the  cosmid  contig  were  used  to  isolate  genomic  clones 
to  generate  exon  boundaries  in  the  5'  portion  of  the 
gene.  Based  on  preliminary  results,  an  exon  numbering 
system  was  adopted  by  consensus  at  the  NNFF  Inter¬ 
national  Consortium  on  Gene  Cloning  and  Gene  Func¬ 
tion  for  NFl  and  NF2  (Oxford,  UK,  1992),  which  redes¬ 
ignated  exon  1  reported  by  Cawthon  et  al,  (1990a)  as 
exon  28;  the  exon  containing  the  stop  codon  was  desig¬ 
nated  exon  49.  We  report  here  the  genomic  organiza¬ 
tion  of  all  NFl  exons,  and  we  present  exon  boundaries 
with  flanking  intronic  sequences. 

To  complete  the  structural  analysis  of  the  NFl  gene, 
we  characterized  the  3'  end  of  iVFl  cDNAs.  None  of  the 
cDNA  clones  used  originally  to  identify  the  NFl  gene 
included  polyadenylation  tracts  that  t3q)ically  mark  the 
3'  end  of  mRNA  transcripts.  Furthermore,  the  11-  to 
13-kb  size  of  human  NFl  mRNA  on  Northern  analysis 
(Buchberg  et  aL,  1990;  Viskochil  et  aL,  1990;  Wallace 
et  aL,  1990;  Suzuki  et  aL,  1992)  predicts  that  there  is 
an  additional  2.5  to  4  kb  of  noncoding  sequence  in  the 
mature  NFl  transcript.  One  human  cDNA  clone  from 
an  oligo(dT)-primed  library  made  from  a  basophilic  leu¬ 
kemia  cell  line  was  reported  to  contain  the  3'  end  of 
NFl  (Suzuki  et  aL,  1992);  however,  only  a  segment  of 
that  clone,  pHN-2,  was  sequenced,  and  the  3'-untrans- 
lated  region  was  not  reported.  Bernards  et  al.  (1993) 
have  cloned  and  sequenced  mouse  cDNAs  that  contain 
up  to  3211  bp  of  DNA  beyond  the  mouse  nfl  stop  codon 
and  have  shown  that  the  mouse  3'-UTR  cDNA  se¬ 
quence  is  highly  homologous  to  the  human  genomic 
sequence  lying  immediately  downstream  of  its  NFl 
stop  codon.  In  this  report  we  define  the  3'  end  of  human 
NFl,  and  we  demonstrate  that  the  3'-untranslated  re¬ 
gion  oiNFl  mRNA  from  four  different  tissues  is  contin¬ 
uous  with  the  stop  codon  in  exon  49. 

MATERIALS  AND  METHODS 

Isolation  of  genomic  clones.  iVFi-specific  genomic  clones  were  se¬ 
lected  from  a  cosmid  library  that  was  constructed  with  DNA  prepared 
from  human  peripheral  blood  leukocytes  and  inserted  into  the 
BamHI  restriction  site  of  the  pWE15  vector  (Stratagene,  La  Jolla, 
CA).  A  premade  P  1-bacteriophage  human  genomic  library  (The  Du- 
Pont-Merck  Pharmaceutical  Co.,  Wilmington,  DE,  DMPC-HFF#1 
Series  B)  cloned  into  the  BamHI  site  in  the  pAdlOsacBII  vector 
(Pierce  and  Sternberg,  1992)  provided  a  second  source  of  genomic 
clones.  High-density  P  1-bacteriophage  libraries  were  screened  as 
previously  described  (O'Connell  et  al,  1990)  using  radioactive  probes 
representing  different  NFl  cDNA  sequences.  Hybridization  on  nylon 
membranes  was  carried  out  in  50%  formamide,  5x  SSC,  50  mM 
sodium  phosphate  (pH  6.5),  2x  Denhardt's  solution,  1%  dextran  sul¬ 
fate,  0.1%  SDS,  and  100  fxg  of  denatured  total  human  DNA  per 
milliliter  for  14'- 16  h  at  42°C.  Filters  were  sequentially  washed  un¬ 
der  increasingly  stringent  conditions,  with  the  final  wash  in  O.lx 
SSC,  0.1%  SDS  at  55°C  for  5  min  for  PI  bacteriophage  libraries  and 
at  room  temperature  for  5  min  for  cosmid  libraries.  Positive  clones 


were  identified  by  autoradiography  and  purified  by  subsequent 
screenings  using  similar  methodology.  Isolated  colonies  were  pre¬ 
pared  by  CsCb /ethidium  bromide  buoyant  density  centrifugation 
using  standard  protocols  (Pierce  and  Sternberg,  1992;  Sambrook  et 
al.,  1989).  Genomic  clones  were  sized  by  agarose  gel  electrophoresis 
of  BcoRI-  and  BamHI-digested  fragments  both  under  standard  condi¬ 
tions  (Sambrook  et  al.,  1989)  and  using  field-inversion  methodology 
(Carle  et  al.,  1986).  The  overlapping  ends  of  selected  co-smid  clones 
were  mapped  by  Southern  anriysis  using  T7  and  T3  polymerase¬ 
generated  RNA  probes  synthesized  from  the  ends  of  the  clones. 

cDNA  screening.  Premade  Lambda  ZAP  cDNA  libraries  (Stra- 
tagene)  were  screened  with  PCR-labeled  probes  as  previously  de¬ 
scribed  (Xu  et  al.,  1990a).  Libraries  constructed  from  four  different 
tissue  sources  were  screened:  fetal  brain  (random  and  oligo(dT) 
primed,  Stratagene,  936206),  fetal  retina  from  pooled  tissue  (oli- 
go(dT)  primed,  Stratagene,  937202),  muscle  from  an  adult  female 
(oligo(dT)  primed,  Stratagene,  936215),  and  adrit  heart  (random  and 
oligo(dT)  primed,  Stratagene,  936207).  Isolated  clones  were  excised 
according  to  the  manufacturer's  recommendations  (Stratagene  Pro¬ 
tocol  200253),  and  the  resultant  NFl  cDNA  inserts  in  pBluescript 
SK(-)  vectors  were  sized  and  sequenced  by  standard  techniques. 

Probe  synthesis.  PCR  probes  were  synthesized  by  incorporating 
[a-®^P]dCTP  into  a  PCR  product  (Saiki  et  al,  1988).  Template  DNA 
was  obtained  from  clones  representing  NFl  cDNA  between  exons  2 
and  27b.  PCR  conditions  for  probe  synthesis  were  as  follows:  1-10 
ng  template  DNA,  1.2  mM  dATP/dTTP/dGTP,  2.5  /zAf  dCTP,  10-15 
fA  of  [a-®^P]dCTP  (Amersham,  Arlington  Heights,  IL;  sp  act,  >6000 
Ci/mmol),  4  mM  MgCk,  10  mM  Tris  (pH  8.3),  50  mM  KCl,  0.5  mM 
primers,  and  2.5  units  Tag  polymerase,  in  a  total  volume  of  50  fA. 
Routine  thermal  cycling  included  denaturation  at  93®C,  variable  an¬ 
nealing  temperatures,  and  extension  at  72®C  for  1  min  for  20  cycles. 
The  final  step  included  heat  denaturation  at  93°C  for  3  min,  and  the 
probe  was  placed  on  ice  for  at  least  5  min  before  being  added  (40  fA) 
to  the  hybridization  solution. 

RNA  probes  from  the  ends  of  cosmid  inserts  were  synthesized  ac¬ 
cording  to  the  manufacturer's  instructions  accompanying  the  T7  and 
T3  RNA  polymerase  transcription  kit  (Ambion,  Austin,  TX;  No. 
1326).  Cosmid  DNA  (1-2  mg)  digested  with  Bsal  served  as  template 
for  probe  synthesis  using  [o:-®^P]UTP  (Amersham,  sp  act,  800  Ci/ 
mmol).  The  probes  were  routinely  used  directly  in  Southern  analysis 
at  template  concentrations  of  10  ng/ml  hybridization  solution. 

Identification  of  intron-exon  boundaries.  Initial  characterization 
of  exon  boundaries  and  flanking  intronic  sequences  was  performed 
in  an  empiric  fashion  by  a  PCR  approach.  Facing  TVFi  cDNA  oligonu¬ 
cleotides,  tagged  with  universal  primer  sequence  at  the  5'  ends  (uni¬ 
versal  (21ml3)— GTAAAACGACGGCCAGT;  reverse  (ml3RPl)— 
AACAGCTATGACCATG),  were  S3mthesized  approximately  150  bp 
apart  according  to  the  cDNA  sequence.  These  primers  were  used  to 
amplify  DNA  isolated  from  genomic  clones.  PCR  conditions  varied 
for  each  primer  pair;  however,  the  routine  reaction  was  based  on  the 
manufacturer’s  recommendations  for  the  respective  Taq  polymerase, 
and  the  annealing  temperature  was  set  at  approximately  5®C  below 
the  estimated  for  35  cycles.  PCR  products  larger  than  the  size 
predicted  on  the  basis  of  cDNA  sequence  presumably  carried  an  in- 
tron,  and  each  PCR  product  was  sequenced  from  both  directions 
to  identify  divergence  points  from  cDNA  sequence.  The  presence  of 
consensus  splice  junctions  at  sites  where  the  sequence  of  a  PCR 
product  diverged  from  known  NFl  cDNA  sequence  identified  the 
intron-exon  boundaries.  Initially,  for  those  “introns”  that  did  not 
amplify,  boundaries  were  obtained  by  direct  DNA  sequencing  of  sub¬ 
clones.  In  this  case,  DNA  from  genomic  clones  was  digested  with 
Sai/3A,  subcloned  in  theBamHI  site  of  pBluescript,  and  selected  with 
NFl  cDNA  probes.  Plasmid  DNA  from  positive  clones  was  directly 
sequenced  using  either  the  universal  or  the  reverse  primer  and  T7 
DNA  polymerase  (Sequenase  Version  2,0,  U.S.  Biochemicals  (USB), 
Cleveland,  OH).  Later  in  the  project  other  techniques  replaced  the 
subcloning  protocols.  Those  regions  that  could  not  be  amplified  by 
primer  pairs  150  bp  apart  were  subjected  to  either  anchored  PCR 
(Roux  and  Dhanarajan,  1990)  or  inverse  PCR  (Ochman  et  al,  1988) 
techniques  to  obtain  exon  boundaries  and  flanking  intronic  se- 
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FIG.  1.  The  iVF7  locus.  Genomic  DNA  is  represented  as  a  hatched  bar;  the  length  and  transcriptional  direction  of  NFl  is  depicted  by 
the  arrow  above  the  region  of  genomic  DNA  that  it  spans.  N  denotes  Nort  restriction  endonuclease  cleavage  sites  (Fountain  et  at,  1989; 
O’Connell  et  at,  1989).  Cosmid  and  PI  clones  that  cover  all  NFl  exons  are  shown  below  the  map.  Clones  not  previously  reported  are 
indicated  by  thick  bars.  Breakpoints  of  two  previously  characterized  chromosomal  rearrangements,  t(l;17)  and  t(17;22)  (O’Connell  et  al, 
1990),  are  indicated  by  downward  arrows  above  the  genomic  fragment. 


quences.  The  PCR  products  were  sequenced  according  to  established 
methods  of  automated  dideoxy  sequencing  using  fluorescently  la¬ 
beled  primers  and  either  T7  or  Taq  polymerase. 

DNA  sequencing.  Purified  genomic  DNAs  from  cosmid  and  PI 
clones  served  as  templates  in  routine  PCR  using  NFl -specific  prim¬ 
ers  tagged  with  21ml3  (upstream  primer)  and  ml3RPl  (downstream 
primer).  The  resulting  PCR  products  were  purified  by  Centricon-100 
ultrafiltration  (Amicon,  Beverly,  MA)  and  sequenced  using  a  test- 
site  protocol  provided  by  Applied  Biosystems,  Inc.  (Foster  City,  CA). 
This  procedure  involved  performing  dideoxy  sequencing  reactions 
with  Taq  polymerase  in  a  thermal  cycler,  using  fluorescently  tagged 
M13  universal  (UP)  or  reverse  (RP)  sequencing  primers,  for  analysis 
on  an  Applied  Biosystems  Model  373A  automated  sequencer. 

Subcloned  genomic  DNA  in  pBluescript  was  sequenced  by  the  di¬ 
deoxy  method  using  either  [a-^^P]dATP  or  P^SJdATP  (Amersham, 
Arlington  Heights,  IL)  and  a  Sequenase  II  kit  with  UP  and  RP  se¬ 
quencing  primers  (USB).  cDNA  inserts  in  pBluescript  vectors  were 
sequenced  with  T7  polymerase  (Sequenase  Version  2.0,  USB)  using 
fluorescently  tagged  UP  and  RP  sequencing  primers  in  an  ABI  proto¬ 
col  (Cawthonc^  al.,  1990a).  Internal  sequence  was  obtained  manually 
using  NFl -specific  primers  with  a  sequencing  kit  (Sequenase  Version 
2.0,  USB)  and 

RESULTS 

Physical  map  of  the  5'  end  of  the  NFl  locus.  We 
had  previously  generated  a  cosmid  contig  now  known  to 
cover  the  portion  of  the  NFl  locus  harboring  exons  28 
through  49  (O’Connell  et  al,  1990;  Viskochil  et  al,  1990). 
The  contig  bounded  by  cosmids  EVI36  and  7D5  is  ap¬ 
proximately  101  kb  (Weiss  et  al.,  1992).  The  telomeric 
end  of  7D5  maps  2.5  kb  upstream  of  the  telomeric  Notl 
site  of  the  approximate-size  350-kb  Notl  fragment  (R. 
Weiss,  University  of  Utah,  pers.  comm.,  1994).  To  char¬ 
acterize  the  5'  end  of  the  NFl  gene  further,  we  used 
NFl  cDNAs  as  probes  under  stringent  hybridization 
conditions  to  isolate  genomic  clones  mapping  upstream 
of  exon  28.  As  shown  in  Fig.  1,  the  newly  defined  geno¬ 
mic  contig  consists  of  cosmids  30,  10,  and  23  and  the 
Pl-bacteriophage  clone  Pl-9.  An  overlap  of  approxi¬ 
mately  7.5  kb  between  the  two  contigs  was  established 
by  Southern  blot  analysis  using  RNA  probes  from  the 
T7  end  of  cosmid  EVI37  and  the  T3  end  of  cos23  (data 
not  shown).  On  the  basis  of  Southern  blot  analysis,  we 
estimate  that  the  contig  consisting  of  cos30,  Pl-9,  cos  10, 
and  cos23  is  approximately  100  kb  long.  Cosmid  HB5 


(Jorde  et  al,  1993)  maps  to  the  centromeric  end  of  the 
350-kb  Notl  firagment,  and  it  extends  approximately  20 
kb  telomeric  of  the  Notl  site  (Fig.  1).  The  difference  be¬ 
tween  the  estimated  size  of  the  Notl  fragment  and  the 
amount  of  genomic  DNA  contained  in  clones  mapping 
between  the  Notl  sites  is  about  115  kb;  this  gap  lies 
between  clones  HB5  and  cos30. 

Even  though  the  cloned  contig  does  not  cover  the 
entire  gene,  all  of  the  NFl  exons  are  represented  in 
genomic  clones.  Exon  1  maps  just  5'  to  the  centromeric 
Notl  site  (Marchuk  et  al,  1992)  and  is  contained  in 
cosmid  HB5  (data  not  shown).  Exon  2  maps  to  cos30 
and  Pl-9  by  Southern  analysis  and  by  sequence  iden¬ 
tity  of  PCR  amplification  products.  Thus,  uncloned  ge¬ 
nomic  DNA  lying  in  the  gap  between  HB5  and  cos30 
is  contained  within  intron  1. 

NFl  exon  boundaries.  The  cloning  of  genomic  DNA 
encompassing  all  known  A/Fi  exons  provided  substrate 
for  the  delineation  of  exon  boundaries  and  flanking  in- 
tronic  sequences.  Using  genomic  clones  from  the  contig, 
four  approaches  enabled  us  to  generate  exon-selected 
template  for  sequencing:  (1)  direct  PCR  using  facing 
exon-based  oligonucleotides  as  primers;  (2)  subcloning 
of  Saz/3A-digested  DNA;  (3)  inverse  PCR  with  tail-to- 
tail,  exon-based  primer  pairs  (Ochman  et  al.,  1988); 
and  (4)  “anchored  PCR”  using  A/Fi  cDNA-specific  prim¬ 
ers  with  a  universal  primer  to  amplify  DNA  fragments 
that  have  been  ligated  to  a  restriction  site-specific,  uni¬ 
versal  anchor  adapter  primer  (Roux  and  Dhanarajan, 
1990).  The  exon  boundaries  depicted  in  Fig.  2  designate 
sites  where  NFl  cDNA  sequence  diverged  from  geno¬ 
mic  DNA  sequence.  Around  these  sites  of  divergence, 
the  sequences  were  typical  of  splice  junctions. 

In  the  course  of  this  work  several  introns  were  identi¬ 
fied  after  the  adoption  of  the  NFl  exon  numbering  sys¬ 
tem  (NNFF  International  Consortium  on  Gene  Cloning 
and  Gene  Function  for  NFl  and  NF2).  We  have  found 
that  exon  4  is  actually  divided  into  4a,  4b,  and  4c;  exon 
10  is  divided  into  10a,  10b,  and  10c;  and  exon  12  is 
divided  into  12a  and  12b.  Exon  23  is  also  interrupted 
by  an  intron,  but  because  the  previous  designation  of 
the  alternatively  spliced  insertion  exon  23a  was.al- 
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MNLLNDCSEV  EDESAQTGGR  KRGMSRRLAS  LRHCTVLAHS  HLLMANVDSG  LMMSiSlOTH  KDLQTRATFM 
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SAGTALLEQH  LHTLDSLRIF  KDKSPEEVFM  AIRNFLEWHC  KQMDHFVGLN  FNSMFHFALV  GHLLK6YRHP 

y4l  ^42 

NTQSVAYLAA  LLTVSEEVRS  RCSLKHRKSL  LLTDXSMENV  PMDTYPXHHG  DPSYRTLKET  QPWSSPKGSE 
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▼46 

DEFDQRILYE  YLAEASWFP  KVFPWHNLL  DSKINTLLSL  CQDPNLLNPI  HGIVQSWYH  EESPPQYQTS 


GNVLALHRLL  HNNQEKXGQY 

▼28 

GNPXFYYVAR  RFKTGQINGD 
6SKRLVFXDC  PGKLAEHXEH 
TXANQGTPLT  FMHQECEAXV 
ANNTLFIVSX  SKTLAANEPH 

y32 

QITDLLDWL  DSFIKTSATG 
YLFHWTFLV  ATGPLSLRAS 

y  36 

LETVTEALLE  IMEACMRJOXP 

y38 

QPLLNKDSPL  HKALFWVAVA 
SPAXVARTVR  ILHTLLTLVN 
GYLAATYPTV  GQTSPRARKS 
NVLLDEEVLT  DPKIQALLLT 

▼47  y 

YLQSFGFNGL  WRFAGPFSKQ 


y49 

PGir 


VKFLDALIDT  YLPGXDEETS  EESLLTPTSP  YPPALQSQLS  XTANI^LSNS  MTSLATSQHS  PGIDKENVEL  SPTTGHCHSG  RTRHGSASQV 
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FIG.  2.  Locations  of  exon  boundaries  in  relation  to  the  iVFi -encoded  amino  acid  sequence  (initiating  methionine  is  1).  Downward 
arrowheads  indicate  exon  boundaries,  and  the  exon  number  is  marked  to  the  right  of  each  arrowhead  at  the  5'  end  of  its  respective  exon. 
The  exon  boundaries  between  27a  and  27b  and  between  19a  and  19b  have  also  been  determined  by  others  (Martin-Gallardo  et  aL,  1992; 
P.  Robinson,  pers.  comm.,  1994). 


ready  firmly  established,  we  elected  to  designate  the 
two  exons  derived  from  exon  23  as  23-1  and  23-2.  We 
have  confirmed  an  unpublished  report  that  exon  19  is 
divided  into  exon  19a  and  19b  (P.  Robinson  et  aL,  Insti¬ 
tute  of  Medical  Genetics,  Humboldt  University,  Berlin, 
pers.  comm.,  1994).  Finally,  exon  27  :s  divided  into  27a 
and  27b  (Martin-Gallardo  et  aL,  1992). 

NFl  exon-flanking  sequences  were  derived  from  the 
following  genomic  clones  (see  Fig.  1):  exon  1 — cosmid 
HB5;  exons  2  through  6 — cosmid  c30;  exons  7  through 
10b — bacteriophage  Pl-9;  exons  10c  through  23-1  — 
cosmid  clO;  exons  23-2  through  27b — cosmid  c23;  and 
exons  28  through  49 — cosmids  cEVI36,  cEVI20,  cT315, 
and  c7D5.  Oligonucleotide  primers  synthesized  from 
intronic  sequences  were  used  to  PCR-amplify  NFl  ex¬ 
ons  using  DNA  from  NFl  genomic  clones  as  template. 
The  sequences  of  these  primer  pairs  are  presented  in 
Table  1  along  with  the  size  of  each  PCR  product  gener¬ 
ated  (shown  in  parentheses  under  the  number  of  the 
exon  that  it  contains). 


Sizes  of  the  majority  ofiVFi  introns  are  shown  in  Table 
2.  The  exact  lengths  of  all  introns  from  exons  28  through 
49  are  known  from  sequence  analysis.  The  lengths  of 
introns  between  exons  upstream  of  exon  27b  have  been 
estimated  by  agarose  gel  electrophoresis  of  PCR  amplifi¬ 
cation  products  using  facing,  exon-based  primer  pairs 
from  adjacent  exons  with  cloned  NFl  genomic  DNA  as 
template.  Inability  to  amplify  by  PCR  indicated  that  the 
intron  was  likely  to  be  longer  than  4  kb  in  length.  Numer¬ 
ous  introns  have  been  fully  sequenced  and  are  marked 
with  an  asterisk  in  column  4  of  Table  2. 

Intron  10c  harbors  sequence  from  a  cDNA,  pHN-3, 
that  was  isolated  from  a  human  placenta  library  (Su¬ 
zuki  et  aL,  1992).  We  found  that  the  divergence  point 
of  pHN-3  with  respect  to  NFl  cDNA  lies  at  an  exon 
boundary  and  that  the  sequence  of  pHN-3  cDNA  down¬ 
stream  of  this  boundary  is  continuous  genomic  se¬ 
quence  that  extends  241  residues  into  intron  10c.  Thus, 
failure  to  splice  exon  10c  to  exon  11  could  lead  to  alter¬ 
native  processing  of  the  NFl  message,  resulting  in  a' 
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TABLE  1 

Intron  Primers  Used  to  Amplify  NFl  Exons 


Exon  Exon 

(bp)  Sequence  (5' 3')  (bp)  Sequence  (5' 3') 


1 

(438) 

2 

(221) 

3 

(195) 

4a 

(415) 

4b 

(339) 

4c 

(283) 

5 

(240) 

6 

(301) 

7 

(373) 

8 

(318) 

9 

(248) 

10a 

(232) 

10b 

(267) 

10c 

(379) 

11 

(189) 

12a 

(303) 

12b 

(382) 

13 

(382) 

14* 

(285) 

15 

(275) 

16* 

(549) 

17 

(318) 


CAGACCCTCTCCTTGCCTCTT  18 

GGATGGAGGGTCGGAGGCTG  (367) 

TTTTAAGGATAAACTGTTTACGTG  19a 

TCCCCAAAACACAGTAACCC  (267) 

TTTCACTTTTCAGATGTGTGTTG  19b 

AATTCCAAGCCTCCTACTTAG  (276) 

TTTGAAAATTTTCATAATAGAAAATGT  20* 

GAGGTCAAAGCTGCTGTGAG  (402) 

CTCCTGGCCTCAAGTGGTC  21 

TTATAAAATCCAGATTGGTGTTC  (374) 

TTTCCTAGCAGACAACTATCGA  22* 

CATCAAAAAAAAAATTTTAATACCAG  (331) 

TGACTTGAGTGATAGTTTCACAT  23-1 

AAAAAAAAATCAATCGTATCCTTA  (281) 

CATGTTTATCTTTTAAAAATGTTGCC  23-2* 

ATAATGGAAATAATTTTGCCCTCC  (268) 

TGCTATAATATTA(X)TACATCTGG  23a 

CCTATGAACTTATCAACGAAGAG  (447) 

TGTGCTGCTTCTGGCAACTG“  24* 

CTAGTCTTTCTGTTTATAAAGGAT'*  (266) 

TTTGACCTCATTTGTATTACTGAG  25* 

AGAACCTTTTGAAACCAAGAGTG  (338) 

ACGTAATTTTGTACTTTTTCTTCC  26 

CAATAGAAAGGAGGTGAGATTC  (342) 

CTTTAAAGTGATAGCTATTACC  27a 

TCTTGGCGATTCAGCTAAACC  (298) 

CTTGGTACCCTTTAGCAGTCAC  27b 

CCTTCTTTCTCCATGGAG  (296) 

GTACTCCAGTGTTATGTTTACC  48a 

TAAAGTTGAAATTTAAAAATTAAAGTAC  (246) 

AAACCTTACAAGAAAAACTAAGCT  49.1 

ATTACCATTCCAAATATTCTTCCA  (327) 

TTTCTAGTGAATCTCCTTCAAGT  49.2 

ATGAAATTTACCAAATTTCATTCAG  (449) 

CACAGTTTATTGCATTGTTAGAT  49.3 

GCCATGTGCTTTGAGGCAGAA  (678) 

TCCTTTTGGGTGGAGCTTATC  49.4 

TATACTTGTAATATGCACGTATC  (776) 

TGTGATCAGGAATAGCTTTTGAA  49.5 

TTAACAGATAAAAGTCAACTTTAC  (888) 

TGGATAAAGCATAATTTGTCAAGT  49.6 

TAGAGAAAGGTGAAAAATAAGAG  (717) 

CTCTGTGTGTTTAGATCAGTCA  49.7 

TTTATCAATTACTACCAGTATCAG  (707) 


AGAAGTTGTGTACGTTCTTTTCT 

CTCCTTTCTACCAATAACCGC 

TCATGTCACTTAGGTTATCTGG 

TGTAATTAAGTAGTTATAACTCTC 

CCTAAAGTTTATATCTGTTAATAAG“ 

TGGTGGGGGGCTTTATTTGC 

CCACCCTGGCTGATTATCG 

TAATTTTTGCTTCTCTTACATGC 

AAATGAAAGTTTCATATAGAAATAC 

ATTTGCTATGTGCCAGGGAC 

TGCTACTCTTTAGCTTCCTAC 

CCTTAAAAGAAGACAATCAGCC 

TTTGTATCATTCATTTTGTGTGTA 

AAAAACACGGTTCTATGTGAAAAG 

CTTAATGTCTGTATAAGAGTCTC 

ACTTTAGATTAATAATGGTAATCTC 

AGCCAGAAATAGTATACATGATTGGGT 

CTATTTTCTGCCAGAATTAGTAGA 

TTGAACTCTTTGTTTTCATGTCTT 

GGAATTTAAGATAGCTAGATTATC 

AATATAATAATTATATTTGGGAAGGT 

GAAAATATTTGATTCAAACAGAGC 

GCTTTGTCTAATGTCAAGTCAC 

TTAAACGGAGAGTGTTCACTATC 

GTTACAAGTTAAAGAAATGTGTAG 

CTAACAAGTGGCCTGGTGGCAAAC'^ 

TTTATTGTTTATCCAATTATAGACTT 

TCCTGTTAAGTCAACTGGGAAAAAC^ 

ATCTAGTATCTAATTGTATTTCACC 

GCAGACTGAGCTTACAGGGAC 

CTGGGAGAAACAGGCTATAC 

TTTGAAACTGCCAGC(X)T 

GCAATGAAATTCAGTCCTGGAAGG 

CCCACTTTCTTTGCAGTTGTTCTG 

AAGAAAGTGGGAGGTCAGGAA 

ACTTTGTTCACTGATTACCCTGC 

AAACTCAGTCATCCAAACTAG 

GAAACTCCCCTCTCCCCTAG 

GTAGTGCTGTTTCATTAGAGG 

ACATTTTAGTTACTTGGAAAGCTAG 

ACCTTGTTTAAAGAACTTCTGAC 

CACAAACTATTTACAGGTGAATATAC 

CAGGTTTGTAGGTTTTAGG 

GGGAAGAAAACAGAGCCGATAC 


Note.  Oligonucleotides  synthesized  from  intronic  sequences  were  used  in  routine  PGR  to  amplify  the  expected  size  products  as  noted  in 
parentheses  below  each  exon  number.  Exon  49  including  the  3'-UTR  has  been  amplified  in  seven  overlapping  segments. 

“  Sequence  derived  from  clones  that  map  to  NFl  -homologous  loci, 

*  Primer  pairs  that  3d  eld  iVFi -specific  sequence  using  uncloned  genomic  DNA  as  template. 

Primer  sequence  from  Martin-Gallardo  et  al  (1992). 


shorter  mRNA  species  that  encodes  a  smaller  isoform 
of  neurofibromin. 

NFl  3* -untranslated  region.  The  original  reports  of 
NFl  cDNAs  failed  to  identify  the  3'  end  of  the  gene. 
The  majority  of  published  NFl  cDNA  sequences  were 
derived  from  a  human  fetal-brain  library  that  was  oli- 
go(dT)  and  random-primed  for  reverse  transcription 
into  cDNA  (Cawthon  et  al.,  1990a;  Xu  et  al.,  1990a; 
Marchuk  et  al.,  1991).  Other  3'-end  cDNAs  were  iso¬ 
lated  from  various  libraries,  including  cauda  equina 


and  B  lymphoblasts  (Wallace  et  al,  1990),  pre-B  lym- 
phoc3des  (Bernards  et  al,  1992),  basophilic  leukemia 
cells  (Suzuki  et  al,  1992),  and  fetal  muscle,  adult  brain, 
and  endothelial  cells  (Marchuk  et  al,  1991);  the  longest 
extended  291  bases  downstream  of  the  stop  codon  (Ber¬ 
nards  et  al,  1992).  None  of  these  NFl  cDNAs  contained 
a  poly(A)  tail,  multiple  clones  were  chimeric,  and  no 
clones  terminated  at  the  same  site.  Subsequently,  we 
isolated  two  cDNA  clones  that  matched  genomic  se¬ 
quence  downstream  of  the  stop  codon,  FB12  and  HB4 
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TABLE  2 


Exon  No. 

cDNA  position 

Size  (bp) 

1 

1 

60 

2 

61 

144 

3 

205 

84 

4a 

289 

195 

4b 

484 

103 

4c 

587 

68 

5 

655 

76 

6 

731 

158 

7 

889 

174 

8 

1063 

123 

9 

1186 

75 

10a 

1261 

142 

10b 

1393 

135 

10c 

1528 

114 

11 

1642 

80 

12a 

1722 

124 

12b 

1846 

156 

13 

2002 

250 

14 

2252 

74 

15 

2326 

84 

16 

2410 

441 

17 

2851 

140 

18 

2991 

123 

19a 

3114 

84 

19b 

3198 

117 

20 

3315 

182 

21 

3497 

212 

22 

3709 

162 

23-1 

3871 

104 

23-2 

3975 

136 

Coding  Exons  of  NFl 


Introns  (kb) 

Exon  No. 

cDNA  position 

Size  (bp) 

?  (20-140) 

(23a 

4111 

63 

^  3.10 

24 

4111 

159 

? 

25 

4270 

98 

? 

26 

4368 

147 

2.0 

27a 

4515 

147 

0.22* 

27b 

4662 

111 

0.80 

28 

4773 

433 

? 

29 

5206 

341 

0.40 

30 

5547 

203 

? 

31 

5750 

194 

4.0 

32 

5944 

141 

? 

33 

6085 

280 

4.0 

34 

6365 

215 

2.5 

35 

6580 

62 

0.54 

36 

6642 

115 

? 

37 

6757 

102 

1.2 

38 

6859 

141 

0.49 

39 

7000 

127 

0.23* 

40 

7127 

132 

1.3 

41 

7259 

136 

0.38* 

42 

7395 

158 

0.28* 

43 

7553 

123 

0.46* 

44 

7676 

131 

1.2 

45 

7807 

101 

0.55 

46 

7908 

143 

0.12* 

47 

8051 

47 

2.2 

48 

8098 

217 

0.14* 

(48a 

8315 

54 

9 

49 

8315 

153 

4.0 

—  3 '  Noncoding  — 

Introns  (kb) 


6.0) 

0.53* 

1.25 

1.27 

? 

45-50 

1.3 

2.7 

4.3 
1.55 
0.15 
0.40 
0.24 
0.15 
0.57 

1.7 

2.4 
6.0 
0.93 
2.0 
4.0 
0.35 
0.18 
1,1 
0.35 

1.4 

6.5 
6.7) 


Note.  Intron  lengths  were  determined  either  by  PCR  amplification  using  facing  exon  primers  from  adjacent  exons  or  by  complete  seauence 
STthan  kk  astensk).  Question  marks  denote  inability  to  amplify  by  PCR  across  the  intron;  such  introns  are  likefy  to  be 


(Fig.  3).  FB12  was  selected  with  a  PCR  probe  from  the 
5'  end  of  exon  49,  and  H4B  was  selected  with  a  PCR 
probe  from  exon  48a.  FB12  ends  at  base  8749  (cDNA 
numbering  system  from  Marchuk  et  al.,  1991),  which 
is  the  same  termination  site  as  the  cDNA  clone  desig¬ 
nated  Nalm6  (Bernards  et  al.,  1992),  and  H4B  termi¬ 
nates  at  base  8926,  respectively,  291  and  474  bases 
downstream  of  the  stop  codon.  Based  on  Northern  anal¬ 
ysis  of  mRNA  species  that  are  approximately  11  and 
13  kb  in  length  (Viskochil  et  al,  1990;  Wallace  et  al, 
1990;  Suzuki  et  al,  1992),  one  would  predict  a  3'-UTR 
of  at  least  2.5  kb;  therefore,  it  seemed  unlikely  that 
H4B  and  FB12  represented  the  3'  end  of  the  gene.  The 
observation  that  the  mouse  nfl  cDNA  is  highly  homolo¬ 
gous  to  human  NFl  genomic  DNA  downstream  of  the 
stop  codon  (Bernards  et  al.,  1993)  suggested  that  geno¬ 
mic  probes  downstream  of  the  stop  codon  could  identify 
cDNAs  that  harbor  3'-UTR  sequences.  A  PCR  probe 
representing  bases  86168  to  87104  in  the  genomic  se¬ 
quence  (Weiss  et  al,  1992)  identified  two  independent 
clones  from  a  fetal-retina  cDNA  library,  FR5  and  FRll, 
and  one  clone,  M13,  from  an  adult-muscle  cDNA  li¬ 
brary;  furthermore,  a  probe  from  bases  88724  through 
89159  (Weiss  et  al,  1992)  identified  a  clone,  FB21D, 
from  a  fetal-brain  cDNA  library.  The  sequences  from 


the  ends  of  the  cDNA  inserts  matched  the  genomic  se¬ 
quence  downstream  of  the  NFl  stop  codon,  and  the 
insert  sizes  were  those  predicted  based  on  the  uninter¬ 
rupted  genomic  sequence.  The  cDNAs  that  map  to  the 
3'  end  of  NFl  are  schematically  represented  in  Fig.  3. 
Clone  FR5  is  the  longest,  covering  approximately  3.45 
kb;  it  begins  with  base  8508  (Marchuck  et  al,  1991), 
53  bp  downstream  from  the  stop  codon  (base  8455),  and 


Intron  48a  Exon  49 

_ TGA 


Genomic  DNA 


-An  FR5 
■An  M13 


FR11 

FB21D 


FB12 

H4B 


200  bp 


FR:  Fetal  Retina 
M:  Muscle 
FB:  Fetal  Brain 
H:  Heart 


FIG.  3.  The  3'-untranslated  region  of NFl .  Genomic  DNA  is  rep¬ 
resented  as  a  hatched  bar,  and  the  boundary  between  intron  48a 
and  exon  49  is  marked.  The  stop  codon  143  bp  downstream  of  the 
boundary  is  represented  as  a  TGA  above  the  genomic  sequence.  Six 
cDNAs  mapping  to  the  3'-UTR  genomic  sequence  are  denoted  by 
horizontal  lines;  clones  FB12  and  H4B  contain  upstream  cDNA  se¬ 
quence  denoted  by  an  angle  at  the  intron-exon  boundary. 
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terminates  in  a  poly(A)  tract  at  base  11954  in  the  cDNA 
sequence,  which  corresponds  to  base  89095  in  the  geno¬ 
mic  sequence  (Weiss  et  al,  1992).  This  clone  identifies 
the  NFl  3'-UTR  as  continuous  with  the  stop  codon  anu 
suggests  that  splicing  does  not  occur  in  the  3'-untrans- 

lated  region.  .  .  . 

Both  cDNA  clones  FR5  and  M13  terminate  at  the 

same  base  with  the  addition  of  poly(A)  tails  of  18  resi¬ 
dues  added  to  an  identical  site  in  the  genomic  sequence. 

The  genomic  sequence  spanning  the  polyadenylation 
site  rrAAGTAAAATGTAAATTCAATCT,  contains  a 
potential  polyadenylation  signal,  AGUi^,  an  uncom¬ 
mon  sequence  variant  that  is  only  3  as  efficient  us 
AAA  in  polyadenylation  activity  (Wickens,  1990). 
There  are  two  adenosine  residues  in  the  genomic  se¬ 
quence  at  the  site  of  poly(A)  addition;  therefore,  18 
residues  could  be  attached  to  the  boldfaced  C  in  the 
same  sequence  shown  (position  11954),  or  16  ^cs^es 
could  be  attached  to  the  boldfaced  A  (position  11956). 
Clone  FRll  extends  to  base  11957  (additional  T  resi¬ 
due),  but  it  does  not  contain  a  poly(A)  tail. 
clone  FB21D,  which  covers  bases  10603  through  1183o- 
Two  sequence  differences  are  noteble  between  the  3  - 
UTR  cDNAs  and  cosmid  7D5,  which  served  as  the  tem¬ 
plate  for  the  genomic  sequence.  Clones  FR5,  FRll,  and 
M13  contain  a  9-bp  insertion  (underlined)  at  base 
10408  shown  in  the  following  sequence:  CAGIGtOGAA- 
CGATGCCAAGCTGCCACC.  Seven  of  the  nine  bases 
represent  a  duplication  of  an  immediate  upstreain  se¬ 
quence.  Clone  FB21D  carries  a  7-bp  deletion  of  a  tan¬ 
dem  duplication  at  base  11346,  underlined  in  the  fol¬ 
lowing  sequence  derived  from  the  cosmid  cfone  W5. 
CAGTTTTGTTTGG  G  T  T  T  T  T  TG  T  T  T  T  T  T- 

possibiUty  that  these  se¬ 
quence  variants  may  represent  useful  polymorphisms 

is  currently  being  investigated.  o  •  j- 

The  contig  olNFl  cDNA  clones  shown  in  Fig.  3  indi¬ 
cates  that  the  stop  codon  and  the  3.5-kb  3'-untrans- 
lated  region  lie  within  the  same  NFl  exon  in  genomic 
DNA  This  observation  establishes  the  3'  end  “Ue 
gene  either  at  base  11954  or  at  base  11956  in  the  cDNA 
Lquence  and  maps  the  end  of  JVFl  approximately  15 
kb  upstream  of  the  telomeric  Noil  site. 

DISCUSSION 

The  cDNAs  described  in  this  report  identify  the  3' 
end  of  the  NFl  mRNA  and  establish  the  telomeric 
boundary  of  the  NFl  gene.  NFl  spans  all  but  approxi¬ 
mately  15  kb  of  an  estimated  350-kb  Noil  fragmen  , 
exons  2  through  47,  which  includes  the  stop  codon  and 
the  3'-UTR,  lie  within  this  single  fragment,  hxon  i, 
which  contains  5'-untranslated  sequence  in  addition 
to  coding  sequence,  maps  to  an  adjacent  12U-kb  Aoti 
fragment  and  lies  immediately  downstream  of  a  Junc¬ 
tional  promoter  of  transcription  (G.  Xu  and  R.  Wite, 
unpublished  data).  The  intron  between  exon  1  and  exon 
2  could  be  as  large  as  140  kb;  20  kb  of  geuomic  DNA 
downstream  of  exon  1  (represented  in  cosmid  HB5)  plus 


an  estimated  115  kb  of  uncloned  P^A  (this 

gap  represents  the  portion  of  the  estimated  350-kb  Not! 
fragment  not  accounted  for  in  the  cloned  contig)  plus 
a  maximum  of  5  kb  depending  on  the  position  of  exon 
2  within  the  end  fragment  of  cosmid  c30,  the  most  5 
genomic  clone  in  the  contig.  Exons  2  through  27b  are 
contained  in  approximately  100  kb  of  cloned  genomic 
sequence,  and  exons  28  through  49  are  contained  in 
approximately  52  kb  of  sequenced  genomic  DNA.  Like 
intron  1,  intron  27b  is  also  relatively  large,  aproxi- 
matelv  45  to  50  kb,  and  it  harbors  three  embedded 
genes  EVI2A  (Cawthon  et  al,  1990b),  EVI2B  (Cawthon 
et  aZ.,  1991),  and  OMGP  (Viskochil  et  al,  1991).  Intron 
27b  £dso  contains  two  variable-repeat  sites,  separated 
by  approximately  31  kb  of  genomic  sequence  which 
have  been  adapted  by  clinical  laboratories  for  liimage 
analysis  (Lazaro  et  al,  1993;  K.  Ward,  University  of 
Utah,  pers.  comm.,  1994). 

The  role  that  the  two  large  introns  may  play  in  NJtl 
expression  is  unknown.  The  genes  embedded  in  intron 
27b  are  likely  to  place  some  constraints  on  the  ubiqui- 
tous  expression  of  NFl,  although  no  transcriptional  in¬ 
terference  has  been  demonstrated.  Even  though  an  ac¬ 
curate  ascertainment  of  the  size  of  intron  1  will  not  be 
possible  until  the  cloned  contig  is  linked  to  cosmid  HB5 
its  estimated  size  of  140  kb  suggests  that  intron  1  could 
harbor  regulatory  elements  that  influence  NFl  expres- 
sion  in  some  way.  In  view  of  the  presence  of  genes 
within  intron  27b,  the  possibility  that  embedded  genes 
also  lie  within  this  relatively  large  intron  should  be 

considered.  .14. 

Knowledge  of  NFl  structure  enabled  us  to  locate 
other  landmarks  in  the  gene.  Amino  acid  homolo^ 
with  yeast  Iral  and  Ira2  covers  encoded  exons  16 
through  40.  The  homology  with  bovine  P120GAP  ex¬ 
tends  from  exon  21  through  27a.  Two  known  NFl 
translocation  breakpoints  t(l;17)  and  t(17;22)  map  to 
introns  27b  and  31,  respectively.  The  AK3  pseudogene 
(Xu  et  al,  1992)  maps  to  intron  39.  Finally,  J^e  two 
known  alternative  splice  exons,  23a  (Nishi  et  al,  1991, 
Suzuki  et  al,  1991)  and  48a  (Cawthon  et  al,  1990a; 
Gutmann  et  al.  1993),  respectively,  map  to  the  10-kb 
intron  23-2  and  the  13.2-kb  intron  48. 

The  two  NFl  cDNA  clones  that  end  in  poly(A)  tails 
define  the  3' -UTR  and  establish  the  full  length  of  the 
mRNA  transcript  as  approximately  12  kb.  There  is  evi¬ 
dence  that  mouse  nfl  cDNAs  demonstrate  altematave 
polyadenylation  (Bernards  et  al,  1993).  Given  that  hu¬ 
man  NFl  3'-UTR  contains  other  potential  polyadenyla¬ 
tion  signals— AUU AAA  at  positions  9919,  10000,  and 
10086  and  AUUAAA  at  positions  9140  and  9979— we 
evaluated  all  isolated  cDNA  clones  for  alternative  poly¬ 
adenylation.  Although  the  cluster  of  polyadenylation 
signals  between  bases  9919  and  10086  could  explain 
the  approximate  2-kb  difference  between  an  11-  and  a 
13-kb  mRNA  species  identified  by  Northern  blot  analy¬ 
sis  (Suzuki  et  al,  1992),  we  have  not  identified  human 
NFl  cDNAs  that  end  within  20  bases  downstream  ot 
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those  motifs.  Therefore,  the  basis  of  the  size  difference 
remains  unknown. 

Several  sequence  motifs  identified  in  3'-UTRs  from 
other  genes  are  known  to  affect  mRNA  stability  and 
gene  regulation  (Jackson,  1993;  Rastinejad  and  Blau, 
1993);  therefore,  we  speculate  that  the  NFl  3'-UTR 
may  play  some  role  in  the  variability  of  clinical  expres¬ 
sion  of  NFl  by  influencing  iVFl  mRNA  levels  in  a  sto¬ 
chastic  manner.  However,  there  are  few  motifs  present 
that  could  provide  clues  toward  understanding  its  role 
in  mRNA  processing.  The  NFl  3'-lJTR  contains  no  UA- 
rich  mRNA  instability  determinants  (consensus  UU- 
AUUUAU),  although  several  adenylation  control  ele¬ 
ments  are  present  (ACEs— consensus  UUUUUAU  or 
UUUUAAU)  (Jackson,  1993),  of  which  the  one  nearest 
to  the  AAGUAA  polyadenylation  signal  lies  460  bases 
upstream.  A  search  of  the  nucleic  acid  sequence  data¬ 
bases  identified  two  homologies  with  3'-UTRs  from 
other  genes;  one  is  a  short  GT-rich  segment  in  /3-actin 
3'-UTR,  and  another  is  a  100-bp  stretch  in  the  a-tropo- 
myosin  3'-UTR.  The  functional  significance  of  these  ho¬ 
mologies  and  motifs  has  not  been  tested. 

Boundaries  and  splice  junctions  have  been  identified 
for  all  of  the  known  NFl  exons.  Examination  of  the 
intronic  splice  sites  reveals  general  adherence  to  the 
GT-AG  rule  (Shapiro  and  Senapathy,  1987).  The  5'- 
splice  sites  are  invariant  for  GT,  although  7  introns 
lack  the  consensus  purine  residue  at  the  third  position. 
Likewise,  the  3'  splice  sites  are  invariant  for  AG,  and 
only  introns  25  and  36  have  a  purine  instead  of  a  pyrim¬ 
idine  in  the  preceding  position.  Notably,  the  3'  splice 
junction  between  exons  10c  and  11  lacks  a  polyp3rrimi- 
dine  tract  immediately  upstream  of  the  fourth  position; 
instead,  it  has  a  stretch  of  7  adenosine  residues  pre¬ 
ceded  by  13  pjmmidine  residues.  This  sequence  struc¬ 
ture  may  confer  a  regulatory  function  for  alternative 
processing  leading  to  the  2.9-kb  mRNA  isoform  from 
human  placenta  that  terminates  within  exon  10c  (Su¬ 
zuki  et  al.,  1992). 

Knowledge  of  the  NFl  exon  boundaries  and  flanking 
sequence  enabled  us  to  design  intron-based  primer 
pairs  to  amplify  individual  exons  from  genomic  DNA 
template.  Using  these  specific  primers  in  PCR  amplifi¬ 
cations,  we  have  been  able  to  determine  the  exon  com¬ 
position  of  genomic  clones,  to  order  the  clones,  and  to 
map  partially  the  regions  of  overlap  within  the  contig. 
All  of  the  known  NFl  exons  are  contained  in  either  PI 
or  cosmid  clones  of  genomic  DNA.  Sequence  analysis 
of  PCR  products  S3mthesized  from  the  genomic  clones 
shown  in  Fig.  1  revealed  exon  sequences  identical  to 
NFl  cDNA  sequence.  Several  other  cosmid  and  PI 
clones  originally  identified  with  NFl  exon-based  probes 
upstream  of  exon  28  were  amplified  by  PCR  using  sev¬ 
eral  primer  pairs;  however,  the  DNA  sequences  of  exon 
regions  from  these  PCR  products  are  not  identical  to 
NFl  cDNA  sequence  (data  not  shown).  These  clones 
map  by  fluorescence  in  situ  hybridization  to  other  chro¬ 
mosomes  (D.  Viskochil,  unpublished  data),  and  we  in¬ 
terpret  this  to  mean  that  they  represent  distant  loci 


related  to  NFl.  NFl -homologous  loci  have  been  de¬ 
scribed  by  others  (Legius  et  al.,  1992;  Gasparini  et  al, 
1993;  Cummings  et  al,  1993);  the  best  characterized 
locus  maps  to  chromosome  15.  It  shares  about  90% 
homology  with  NFl  within  the  intronic  regions  where 
sequences  are  known  (Y.  Li  and  R.  Cawthon,  unpub¬ 
lished  data).  Although  a  high  degree  of  sequence  ho- 
molo^  exists  between  exons  in  the  NFl  GAP-related 
domain  and  the  corresponding  portion  of  the  chromo¬ 
some  15  locus,  we  have  noted  some  frame-shifting  dele¬ 
tions  in  the  latter.  Therefore,  we  conclude  that  even  if 
it  is  transcribed,  the  chromosome  15  homologue  does 
not  encode  a  functional  GAP-related  protein.  The  iden¬ 
tification  of  RT-PCR  products  where  sequences  are  ho¬ 
mologous,  but  not  identical,  to  NFl  exons  (Cummings 
et  al,  1993)  suggests  that  at  least  some  NFl -related 
loci  may  be  transcriptionally  active.  We  have  not  fully 
evaluated  the  possibility  that  the  chromosome  15  locus, 
or  the  other  NFl -homologous  loci,  is  expressed  either 
in  various  tissues  or  in  a  developmentally  regulated 
fashion. 

We  have  S5mthesized  oligonucleotide  primers  based 
on  flanking  intronic  sequences  for  56  of  the  59  NFl 
exons  and  have  demonstrated  PCR  amplification  of  a 
product  of  the  expected  size  from  genomic  clones  for 
each  of  them;  primer  pairs  for  exons  37,  38,  and  39 
remain  to  be  synthesized.  However,  amplification  by 
PCR  of  NFl -homologous  loci  when  using  oligonucleo¬ 
tide  primers  that  are  derived  from  intronic  sequence 
from  bona  fide  NFl-specific  genomic  clones  complicates 
efforts  to  screen  genomic  DNA  for  NFl  mutations  by 
the  direct-sequencing  approach.  This  approach  may  re¬ 
quire  knowledge  of  intronic  DNA  sequences  derived 
from  cloned  NFl -homologous  loci  before  primer  pairs 
can  be  designed  to  amplify  NFl  sequence  specifically 
from  genomic  DNA.  Using  this  information,  we  have 
synthesized  seven  primer  pairs  (superscript  b  in  Table 
1)  that  yield  NFl-specific  sequence  using  uncloned  ge¬ 
nomic  DNA  as  template.  The  development  of  intron- 
based,  exon-specific  pairs  of  primers  to  amplify  each 
NFl  exon  from  genomic  DNA  is  a  logical  first  step  to¬ 
ward  implementation  of  a  comprehensive,  DNA-based 
screening  protocol  for  NFl  mutations. 
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Using  fluorescence  in  situ  hybridization  (FISH),  we 
have  identified  seven  JVFI -related  loci,  two  separate 
loci  on  chromosome  2,  at  bands  2q21  and  2q33-q34, 
and  one  locus  each  on  five  other  chromosomes  at  bands 
14qll.2, 15qll.2, 18pll.2, 21qll.2-q21,  and  22qll.2.  Ap¬ 
plication  of  PCR  using  NFl  primer  pairs  and  genomic 
DNA  from  somatic  cell  hybrids  confirmed  the  above 
loci,  identified  additional  loci  on  chromosomes  12  and 
15,  and  showed  that  the  various  loci  do  not  share  ho¬ 
mology  beyond  NFl  exon  27h.  Sequenced  PCR  prod¬ 
ucts  representing  segments  corresponding  to  NFl  ex¬ 
ons  from  these  loci  demonstrated  greater  than  95%  se¬ 
quence  identity  with  the  NFl  locus.  We  used  sequence 
differences  between  bona  fide  NFl  and  JVFI -homolo¬ 
gous  loci  to  strategically  design  primer  sets  to  specifi¬ 
cally  amplify  30  of  36  exons  within  the  5'  end  of  the  NFl 
gene.  These  developments  have  facilitated  mutation 
analysis  at  the  NFl  locus  using  genomic  DNA  as  tem¬ 
plate.  ©  1995  Academic  Press,  Inc. 


INTRODUCTION 

Neurofibromatosis  1  is  an  autosomal  dominant  disor¬ 
der  that  afflicts  approximately  1  in  3500  individuals 
worldwide  (Crowe  et  aL,  1956;  Riccardi,  1992).  It  mani¬ 
fests  in  an  age-related  fashion  and  is  characterized 
mainly  by  cafe  au  lait  spots,  neurofibromas,  and  Lisch 
nodules  of  the  iris.  The  NFl  gene  maps  to  chromosome 
17qll.2  (Barker  et  aL,  1987),  and,  even  though  it  was 
cloned  in  1990  (Cawthon  et  aL,  1990;  Viskochil  et  aL, 
1990;  Wallace  et  aL,  1990),  its  complete  genomic  orga- 

Sequence  data  from  this  article  have  been  deposited  with  the 
EMBL/GenBank  Data  Libraries  under  Accession  Nos.  U35684- 
U35699. 

^  To  whom  correspondence  should  be  addressed  at  the  Division  of 
Medical  Genetics,  413  MREB,  University  of  Utah,  Salt  Lake  City, 
UT  84112.  Telephone:  (801)  581-8943.  Fax:  (801)  585-5241. 


nization  was  reported  only  recently  (Li  et  aL,  1995).  It 
is  a  large  and  complex  gene  that  spans  approximately 
335  kb  of  genomic  DNA  and  encodes  a  protein  (neu- 
rofibromin)  of  2818  amino  acids  (Marchuk  et  aL,  1991). 
The  intron/exon  boundaries  of  the  coding  region  were 
recently  characterized,  and  it  was  established  that  iVFi 
is  composed  of  60  exons,  inclusive  of  three  alternatively 
spliced  insertion  exons  (Li  et  aL,  1995;  Danglot  et  aL, 
1995).  The  NFl  promoter  region  (Hajra  et  aL,  1994) 
and  the  3'  untranslated  region,  which  spans  3.5  kb  of 
genomic  DNA  (Li  et  aL,  1995),  have  also  only  recently 
been  characterized.  NFl  functions  in  part  as  a  tumor 
suppressor  gene,  whereby  inactivating  mutations  lead 
to  loss  of  neurofibromin  function  associated  with  typi¬ 
cal  NFl-related  tumors  as  well  as  other  malignancies 
(Andersen  et  aL,  1993;  Basu  et  aL,  1992;  DeClue  et  aL, 
1992;  Johnson  et  aL,  1993;  Legius  et  aL,  1993;  Li  et  aL, 
1992;  The  et  aL,  1993;  Shannon  et  aL,  1994;  Xu  et  aL, 
1992).  This  function  is  mediated,  at  least  in  part,  by 
the  GAP  (GTPase  activating  protein)-related  domain 
that,  in  vitro,  stimulates  hydrolysis  of  p21ras-GTP 
(Ballester  et  aL,  1990;  Martin  et  aL,  1990;  Xu  et  aL, 
1990a).  Thus,  inactivating  NFl  mutations  could  play 
a  significant  role  in  tumorigenesis  via  altered  signal 
transduction  through  p21ras. 

Reliable  identification  of  NFl  mutations  by  PCR- 
based  screening  of  NFl  exons  in  genomic  DNA  would 
be  useful  for  studies  of  NFl  disease  and  for  analysis  of 
the  role  of  somatic  NFl  mutations  both  in  sporadic 
tumorigenesis  and  in  the  variability  of  its  clinical  ex¬ 
pression.  The  large  size  of  the  gene,  high  mutation  rate, 
and  the  predominance  of  small  deletions/insertions  and 
point  mutations  have  complicated  the  detection  of  NFl 
mutations.  In  addition,  other  loci  have  been  observed 
to  coamplify  with  the  NFl  gene  in  some  PCR  reactions 
when  performed  with  primer  pairs  that  lie  upstream 
of  exon  28,  including  segments  that  encode  the  catal3d;ic 
domain  of  neurofibromin.  An  NFl  related  locus  on  chro- 
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mosome  15  (GenBank  Accession  No.  M84131)  was  re¬ 
ported  by  Begins  et  al.  (1992),  and  a  different  locus 
on  chromosome  15  that  mapped  to  band  15q24-qter 
(NFIHHS;  EMBL  Accession  No.  X72619)  was  detected 
by  Gasparini  et  al,  (1993).  iVFi -homologous  loci  on 
chromosomes  14  and  22  were  reported  by  Marchuk  et 
al.  (1991),  and  an  AFl -related  locus  on  chromosome  21 
at  21qll.2  was  reported  by  Suzuki  et  al.  (1994).  Other 
iVFi -related  loci,  reported  in  abstract  form,  have  been 
identified  on  chromosomes  2,  12,  and  20  (Cummings  et 
al.,  1993).  The  extent  of  homologies  between  NFl  and 
the  various  NFl -related  loci  has  not  been  elucidated. 

In  the  process  of  sequencing  genomic  DNA  clones 
both  to  determine  NFl  exon  boundaries  and  to  develop 
intron-based  PCR  primers  for  individual  NFl  exons, 
we  identified  exonic  sequences  from  some  of  our  geno¬ 
mic  clones  that  did  not  match  NFl  cDNA  sequence.  In 
addition,  sequence  of  several  PCR  products  generated 
from  total  human  genomic  DNA  from  non-NFl  controls 
revealed  genomic  sequence  that  was  clearly  derived 
from  more  than  one  locus.  This  strongly  suggested  the 
presence  of  additional  loci  in  the  genome.  We  used  the 
genomic  clones  as  probes  in  FISH  (fluorescence  in  situ 
hybridization)  analysis  to  identify  seven  iVFi -related 
loci,  and  we  used  PCR  of  monochromosomal  somatic 
cell  hybrids  to  identify  two  additional  loci.  To  improve 
NFl  mutation  analysis  using  total  genomic  DNA  as  a 
template,  we  have  designed  a  number  of  oligonucleo¬ 
tide  primer  pairs  to  amplify  specifically  from  the  NFl 
locus  on  chromosome  17  by  PCR.  Implementation  of 
these  primer  pairs  would  simplify  germline  and  so¬ 
matic  mutation  detection  using  PCR-based  methods 
and  genomic  DNA  as  a  template. 

MATERIALS  AND  METHODS 

Isolation  of  NFl  genomic  clones.  P-1  bacteriophage  clones  con¬ 
taining  human  genomic  DNA  were  identified  with  NFl  exon  probes 
by  a  high-density  screening  protocol  (Li  et  al.,  1995).  Pooled  probes 
containing  sequence  from  exons  8  through  10c,  18  through  21,  and 
26  through  27b  were  labeled  by  using  NFl  cDNA  as  template  and 
exon-based  oligonucleotides  as  primers.  Hybridization  was  carried 
out  under  standard  high-stringency  conditions  as  previously  de¬ 
scribed  (Li  et  al.,  1995).  Multiple  P-1  bacteriophage  clones  were  iden¬ 
tified,  and  they  were  partially  mapped  to  the  NFl  locus  by  their 
ability  to  serve  as  template  for  PCR  under  standard  conditions  using 
facing  primers  encompassing  specific  NFl  exons.  Cosmid  clones  car¬ 
rying  human  genomic  DNA  prepared  from  peripheral  blood  leuko¬ 
cytes  in  the  pWE15  vector  (Stratagene,  San  Diego,  CA)  were  isolated 
by  standard  techniques  as  previously  described  (Li  et  al.,  1995). 

DNA  sequencing.  AFi -specific  primers  tagged  with  21ml3  (up¬ 
stream  primer)  and  mlSRPI  (downstream  primer)  sequences  were 
used  for  PCR  amplification  of  the  purified  genomic  DNAs  from  cosmid 
and  PI  clones.  The  PCR  products  were  purified  and  sequenced  using 
a  test  site  protocol  provided  by  Applied  Biosystems  Inc.  (Foster  City, 
CA)  as  previously  described  (Li  et  al.,  1995).  Subcloned  genomic  DNA 
in  Bluescript  was  sequenced  by  the  dideoxy  method  as  described 
previously  (Li  et  al.,  1995). 

FISH  analysis.  Metaphase  chromosome  preparations  of  lympho- 
blastoid  cell  lines  were  prepared  by  standard  techniques.  FISH  was 
performed  as  previously  described  for  cosmid  probes  (Xu  et  al.,  1994) 
and  Pl-bacteriophage  probes  (Albertsen  et  al.,  1994).  In  brief,  chro¬ 


mosomes  were  first  G-banded  using  trypsinAVright  stain,  and  repre¬ 
sentative  cells  were  photographed.  Slides  were  destained  in  fixative 
and  then  used  for  hybridizations.  Genomic  clones  were  labeled  with 
biotin  using  the  BioNick  kit  labeling  system  (BRL,  Gaithersburg, 
MD).  Approximately  250  ng  of  probe  and  15  gg  of  human  Cot-1  DNA 
were  mixed,  denatured  at  75°C,  and  preannealed  at  37°C  for  1-4  h 
before  placement  on  the  slide.  After  hybridization  at  37''C  for  16  h, 
the  slides  were  washed  in  50%  formamide  at  37°C  and  2x  SSC  at 
47°C  for  both  PI  and  cosmid  clones.  Biotin  signals  were  detected 
using  fluorescein-conjugated  avidin  and  amplified  with  biotinylated 
goat  anti-avidin,  followed  by  a  second  layer  of  fluorescein-avidin. 
Fluorescence  was  visualized  using  a  Olympus  BX50  epifluorescence 
microscope  equipped  with  filter  sets  specific  for  FITC.  Chromosomes 
were  counterstained  with  propidium  iodide  and  the  metaphases  that 
were  photographed  by  banding  were  identified  and  photographed 
showing  hybridization  for  chromosome  signals.  FISH  signals  for  each 
metaphase  were  mapped  to  the  respective  chromosomal  band  by  vi¬ 
sual  comparison  of  photographs. 

PCR  amplification  of  monochromosomal  somatic  cell  hybrids. 
The  DNA  from  human/rodent  somatic  cell  hybrid  cultures  with  a 
reduced  number  of  human  chromosomes  (Repository  No.  NA  10868) 
was  obtained  from  Coriell  Cell  Repositories  (Human  Genetic  Mutant 
Cell  Repositories,  Camden,  NJ).  PCR  amplification  of  exons  l-27b 
from  DNA  from  monochromosomal  somatic  cell  hybrids  and  from 
mouse  and  hamster,  depending  on  the  rodent  background,  was  per¬ 
formed  using  primer  pairs  previously  described  (Li  et  al.,  1995).  PCRs 
were  performed  on  a  Perkin -Elmer  Cetus  thermocycler.  In  general, 
all  of  the  PCRs  were  performed  for  30  cycles,  in  a  final  volume  of  50 
gl.  The  monochromosomal  somatic  cell  hybrid  DNA  template  used 
was  50  ng  in  a  final  volume  of  50  ^1  except  for  exons  8,  9,  10a,  10c, 
and  23-2,  where  the  concentration  of  the  DNA  template  used  was 
50  ng  in  25  //I.  A  master  mix  was  made  for  25  tubes  that  had  the 
following  components  at  final  concentrations:  Ix  PCR  buffer  (Gibco- 
BRL,  Grand  Island,  NY),  0.2  mM  each  dNTP,  1.5  mM  MgCL,  0.5 
(iM  upstream  and  downstream  primers,  and  water  up  to  25  or  50 
g\.  The  PCR  mixes  were  assembled  on  ice  using  a  dedicated  set  of 
pipettes,  and  the  mixture  was  UV  irradiated  at  254  nm  for  10  min 
(Stratalinker).  The  mixture  was  aliquoted  into  microfuge  tubes,  and 
1  unit  of  Taq  polymerase  and  the  appropriate  DNA  template  were 
added.  The  cycling  conditions  consisted  of  initial  denaturation  at 
94°C  for  5  min  followed  by  30  cycles  of  denaturation  at  94°C  for  1 
min,  56°C  for  1  min,  and  72°C  for  1  min  followed  by  a  final  extension 
of  72°C  for  5  min. 


RESULTS 

Identification  and  Characterization  of  Genomic 

Clones 

We  used  NFl  cDNA  segments  to  isolate  genomic 
clones  from  cosmid  and  Pl-bacteriophage  libraries  to 
achieve  full  coverage  of  the  NFl  locus  as  a  contig.  Ap¬ 
proximately  400  kb  of  genomic  DNA  is  encompassed 
by  either  cosmid  or  PI  clones,  yet,  even  though  all  60 
NFl  exons  are  represented  in  genomic  clones  from  this 
contig  (Li  et  al.,  1995),  a  gap  remains  in  intron  1.  To 
fill  this  gap  and  further  refine  the  contig  map  in  the  5' 
end  of  iVFi,  a  PCR-based  approach  was  used  to  identify 
the  extent  of  exon  representation  in  each  of  the  genomic 
clones.  Using  intron-based  primer  pairs  flanking  each 
exon,  we  PCR-amplified  genomic  DNA  from  all  of  the 
clones  in  the  contig.  The  presence  of  the  appropriate 
size  band  as  detected  by  agarose  gel  electrophoresis 
confirmed  the  presence  of  that  specific  exon  and  en¬ 
abled  us  to  compile  the  NFl  exon  content  of  each  clone. 
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MR2139-  0a^#96  of  338 

GCATTCAAANTACT  TTANNCATCAAATAAAAC  GAAC  TAAA 


MR2146-  30^108  of  668 

GCATTCAAAGTAC  TTTAGTCATCAAATAAAATGAAC  TAAA 


MR2146-  22^#97  of  354 

GCATTCAAAATAC  TTTAATCATCAAATAAAAC  GAAC  TAAA 

GCRTTCflRflRTRCTTTRRTCRTCRRRTRRRRC  GRRC  TRRR 

A  A  n 


P1-10 

(15q11.2) 


P1-7 

(2q21,14q11.2) 


FIG.  1.  An  example  of  sequence  analysis  of  a  PCR  product  flanking  exon  18  using  total  human  genomic  DNA  and  DNA  from  PI  clones 
Pl~10  and  Pl-7  as  template.  Total  human  genomic  DNA  shows  coamplification  of  more  than  one  base  at  the  positions  marked  by  arrowheads. 
The  first  arrowhead  shows  coamplification  of  a  G  and  an  A  and  Pl-10  shows  a  G,  while  Pl»7  shows  an  A.  Similarly,  the  second  arrowhead 
shows  coamplification  of  a  G  and  an  A  and  Pl-10  shows  a  G,  while  Pl-7  shows  an  A.  The  third  arrowhead  shows  coamplification  of  a  T 
and  a  C,  and  Pl-10  shows  a  T,  while  Pl-7  shows  a  C,  The  base  marked  by  an  asterisk  shows  coamplification  of  a  T  and  a  C,  and  Pl-10 
and  Pl-7  both  show  a  C;  the  T  comes  from  chromosome  22  not  represented  by  either  of  these  clones,  but  identified  by  PCR  amplification 
of  somatic  cell  hybrids. 


We  used  the  same  primer  pairs  in  -mutation  analy¬ 
sis  to  amplify  uncloned  total  genomic  DNA  template, 
and  sequences  from  some  PCR  products  were  ambigu¬ 
ous  even  though  4%  agarose  gels  showed  only  single 
bands.  In  our  efforts  to  optimize  the  sequencing  proto¬ 
cols  from  uncloned  genomic  DNA  template,  we  se¬ 
quenced  PCR  products  from  the  various  genomic 
clones,  each  of  which  represent  single  alleles.  Exon 
DNA  sequence  from  PCR  products  synthesized  with 
intron-based  primers  using  cos  30  and  Pl-9  DNA  tem¬ 
plate  was  identical  to  NFl  cDNA  sequence  for  multiple 
exons.  However,  using  identical  analysis,  other  cosmid 
and  PI  clones,  namely  Pl-4,  Pl-7,  Pl-15  and  cos  A, 
which  were  originally  identified  with  NFl  cDNA 
probes,  revealed  discrepancies  with  the  NFl  cDNA  se¬ 
quence,  Figure  1  shows  an  example  of  sequence  analy¬ 
sis  of  a  PCR  product  flanking  exon  18  using  total  hu¬ 
man  genomic  DNA  and  DNA  from  PI  clones  Pl-10  and 
Pl-7  as  template.  Total  human  genomic  DNA  shows 
coamplification  of  more  than  one  base  at  the  positions 
marked  by  arrowheads,  and  both  bases  are  represented 
by  the  two  PI  clones,  which  differ  from  the  NFl  locus 
at  those  sites.  To  characterize  these  separate  loci  bet¬ 
ter,  we  applied  the  technique  of  fluorescence  in  situ 
hybridization  to  map  various  iVFi  cDNA-selected  geno¬ 


mic  clones  to  their  respective  loci  on  metaphase  chro¬ 
mosome  spreads. 

Fluorescence  in  Situ  Hybridization 

FISH  analysis  was  performed  for  two  cosmids,  cos  A 
and  cos  30,  and  eight  Pl-bacteriophage  clones,  Pl-4, 
Pl-7,  Pl-9,  Pl-10,  Pl-12,  Pl-15,  Pl-16,  and  Pl-60.  Ex¬ 
amples  of  FISH  mapping  studies  are  shown  in  Fig,  2 
for  three  PI  clones,  Pl-10,  Pl-16,  and  Pl-7.  The  left- 
hand  photo  in  Fig.  2A  shows  strong  hybridization  of 
Pl-10  to  a  D  group  chromosome,  and  the  right-hand 
photo  shows  a  G-banding  pattern  of  the  same  meta¬ 
phase  spread,  which  enabled  us  to  map  the  locus  to 
chromosome  15  at  band  15qll.2.  Twenty  cells  that 
showed  signal  on  both  homologues  were  evaluated  for 
the  number  of  chromatids  with  signal  (expect  four  per 
cell)  to  estimate  the  efficiency  of  hybridization,  which 
was  63%  for  Pl-10.  The  left-hand  photo  of  Fig.  2B 
shows  hybridization  of  Pl-16  to  an  A  group  chromo¬ 
some,  and  the  right-hand  photo  locates  the  map  posi¬ 
tion  to  bands  q33-q34  on  chromosome  2,  The  efficiency 
of  hybridization  is  53%  for  Pl-16.  The  FISH  photo  in 
Fig.  2C  shows  weaker  hybridization  signal  of  Pl-7  to 
a  group  A  chromosome  and  a  group  D  chromosome,  and 


FIG.  2.  FISH  analysis  on  the  left  and  G-banded  chromosome  analysis  on  the  right  for  identical  metaphase  spreads.  (A)  FISH  mapping 
of  Pl-10  to  chromosome  15qll.2.  (B)  FISH  mapping  of  Pl-16  to  chromosome  2q33-q34.  (C)  FISH  mapping  of  Pl-7  to  chromosomes  2q21 
and  14qll.2. 
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TABLE  1 


Chromosomal  Location  of  Genomic  Clones  That 
Were  Selected  by  High-Stringency  Hybridization  Us¬ 
ing  NFl  cDNA  Segments  as  Probes 


Genomic 

clone 

Chromosomal 

location 

Method  of 
determination 

NFl  exons 

Cos  HB5 

17qll.2 

Seq,  SBA 

5UTR-1 

Pl-60 

17qll.2 

FISH 

5UTR-1 

Cos  30 

17qll.2 

Seq,  FISH 

2-6 

Pl-9 

17qll.2 

Seq,  FISH 

2-11 

Cos  10 

17qll.2 

Seq 

lOc-23-1 

Cos  23 

17qll.2 

Seq 

23-2-27b 

Pl-12 

17qll.2 

FISH 

Intron  27b 

Cos  7 

15 

Seq 

Cos  A 

18pll.2,  21pll.2 

FISH 

Pl-16 

2q33-q34 

FISH 

Pl-15 

2q33-q34,  14qll.2, 
22qll.2 

FISH 

Pl-7 

2q21,  14qll.2 

FISH 

Pl-4 

15qll.2 

FISH 

Pl-10 

15qll.2 

FISH 

Note.  Seq,  sequence  identity  by  comparison;  FISH,  fluorescence 
in  situ  hybridization;  SB  A,  Southern  blot  analysis.  The  right-hand 
column  shows  exon  representation  of  the  A/Fi -specific  genomic 
clones. 

the  adjacent  photo  maps  the  FISH  signals  to  chromo¬ 
some  2  at  band  2q21  and  chromosome  14  at  band 
14qll.2.  The  efficiency  of  hybridization  was  estimated 
to  be  99%  for  the  chromosome  2  locus  and  93%  for  the 
chromosome  14  locus  for  20  metaphases  evaluated.  The 
chromosomal  location  of  other  clones  mapped  by  FISH 
analysis  is  shown  in  Table  1.  These  loci  have  been  des¬ 
ignated  as  NFl  loci  (NFl-L)  and  they  map  to  chromo¬ 
some  2,  NF1L2  at  2q21  and  NF1L3  at  2q33-q34,  chro¬ 
mosome  14,  NF1L4  at  14qll.2,  chromosome  15,  at 
15qll,2,  chromosome  18,  NF1L5  at  18pll.2,  chromo¬ 
some  21,  at  21qll.2-q21,  and  chromosome  22,  NF1L6 
at  22qll.2.  The  loci  on  chromosome  15  and  21  have 
been  previously  designated,  as  they  have  also  been  re¬ 
ported  by  other  authors. 

Monochromosomal  Somatic  Cell  Hybrid  PCR 

Analysis 

iVFi -homologous  loci  were  also  characterized  by  PCR 
amplification  and  direct  sequencing  of  PCR  products 
from  monochromosomal  somatic  cell  hybrids.  Using 
DNA  extracted  from  somatic  cell  hybrids  containing 
single  human  chromosomes  in  a  rodent  background,  we 
attempted  amplification  from  each  monochromosomal 
somatic  cell  hybrid  with  a  combination  of  both  intron- 
based  and  exon-based  primers  corresponding  to  NFl 
exons  1  through  27b.  Exon-based  primers  over  different 
segments  of  the  NFl  gene  did  not  identify  intronless 
(processed)  pseudogenes.  Each  intron-based,  exon-spe¬ 
cific  primer  set  was  used  to  PCR  amplify  monochromo¬ 
somal  somatic  cell  hybrid  DNA.  The  exonic  segments 
of  the  NFl  gene  that  coamplify  from  both  the  NFl  locus 


and  other  chromosomes  are  listed  in  Table  2,  and  the 
respective  sequence  differences  are  also  presented.  In 
summary,  PCR  amplification  of  somatic  cell  hybrids 
confirmed  the  FISH  mapping  studies  for  chromosomes 
2,  14,  15,  18,  21,  and  22  and  identified  other  homolo¬ 
gous  loci  on  chromosomes  12  and  15. 

NFl -Related  Loci  on  Chromosome  15 

We  used  PCR  amplification  of  somatic  cell  hybrids 
and  FISH  analysis  to  demonstrate  two  loci  on  chro¬ 
mosome  15.  We  previously  identified  one  locus  by 
evaluation  of  an  SSCP  variant  PCR  product.  The 
primers  FLR5,  5^-GTTAGCCAGCGTTTCCCTCAG-3^ 
and  FLR3,  5'-GATTCTAGGTGGTGGCTTTTTATC-3^ 
PCR  amplify  a  129-bp  product  spanning  a  segment  of 
exon  24  in  the  NFl -GRD  that  encodes  the  highest  den¬ 
sity  of  amino  acid  residues  completely  conserved  in 
neurofibromin,  pl20GAP,  and  the  yeast  IRA  proteins. 
When  this  primer  pair  was  used  against  total  human 
genomic  DNA,  a  clean  product  of  the  expected  size  was 
seen  on  agarose  gel  electrophoresis.  However,  when  the 
products  were  analyzed  by  single-stranded  conformer 
analysis  (PCR-SSCP,  Orita  et  ah,  1989),  a  single  com- 

TABLE2 

Compilation  of  NFl  Segments  That  Coamplify  by 
PCR  and  Show  Sequence  Differences  between  the  NFl 
Locus  and  Homologous  Loci  from  the  Various  Chromo¬ 
somes 


Exon 

Locus 

Sequence  alterations 

7 

chr  21 

9  substitutions 

8 

chr  18 

8  substitutions 

chr  21 

4  substitutions,  3-bp  del 

9 

chr  18 

3  substitutions 

chr  21 

4  substitutions 

11 

chr  21 

2  substitutions 

13 

chr  2 

7  substitutions,  1-bp  ins  (A),  2-bp  del 

chr  14 

11  substitutions,  1-bp  ins  (T),  2-bp  del 

15 

chr  14 

4  substitutions 

chr  15 

5  substitutions 

16^^^ 

chr  12 

14  substitutions,  1-bp  del  (G) 

chr  14 

15  substitutions 

17 

chr  14,  22 

Sequence  not  available 

18 

chr  14 

9  substitutions 

chr  15 

6  substitutions,  1-bp  del 

chr  22 

6  substitutions 

19a 

chr  14,  22 

Sequence  not  available 

19b 

chr  15 

3  substitutions,  4-bp  ins 

20 

chr  15 

11  substitutions 

21 

chr  15 

14  substitutions,  1-bp  del,  3-bp  del 

22 

chr  15 

Sequence  not  available 

23-1 

chr  15 

3  substitutions 

24 

chr  15  (qll.2) 

10  substitutions 

chr  15  (q24-qter) 

8  substitutions 

27a 

chr  15 

8  substitutions,  1-bp  del 

27b 

chr  14 

Sequence  not  available 

chr  15 

12  substitutions,  3-bp  del 

Note.  For  exon  16  marked  by  an  asterisk,  alterations  are  reported 
in  comparison  with  A/Fi  cDNA  position  2455  to  2745.  Sequence  from 
each  locus  has  been  submitted  to  GenBank. 
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plex  pattern  was  obtained  from  all  individuals  tested. 
When  an  NFl  genomic  clone  from  this  region,  cos  10, 
was  used  as  a  template,  a  subset  of  the  complex  pattern 
was  seen.  This  suggested  that  a  locus  elsewhere  in  the 
genome  was  coamplifying  along  with  NFl.  Direct  se¬ 
quencing  of  PCR  products  prepared  from  each  SSC 
band  revealed  two  sequences,  one  matching  the  known 
NFl  sequence,  the  other  differing  by  basepair  substitu¬ 
tions  at  seven  sites  in  exon  24  (bases  4111  through 
4269):  at  bases  4122  (G  to  A),  4140  (C  to  T),  4144  (G 
to  A),  4151  (T  to  C),  4179  (C  to  G),  4187  (C  to  A),  and 
4199  (C  to  T).  This  locus  was  mapped  to  chromosome 
15  by  performing  the  same  PCR-SSCP  analysis  on  a 
panel  of  human/rodent  somatic  cell  hybrids  from  BIOS 
(New  Haven,  CT).  Sublocalization  within  chromosome 
15  was  not  possible  using  this  hybrid  system;  however, 
this  sequence  is  identical  to  that  of  NFl-HHS  (Gaspar- 
ini  et  aL,  1993),  which  was  localized  to  q24  to  qter  from 
chromosome  15.  The  demonstration  of  an  AFi -related 
locus  on  chromosome  15  at  band  qll.2  by  FISH,  using 
Pl-4  and  Pl-10  clones  as  probes,  predicted  the  presence 
of  a  second  separate  locus  on  this  chromosome.  The 
sequence  of  a  PCR  product  for  exon  24  from  Pl-4  differs 
by  three  bases  from  the  NFl-HHS  sequence  at  bases 
4117  (C/A),  4188  (C/T),  and  4244  (A/G),  which  supports 
the  hypothesis  that  there  are  two  loci  on  chromosome 
15.  We  compared  sequences  from  exons  20,  21,  and  27a 
between  clone  Pl-4  and  the  AFi -related  locus,  and  the 
sequences  were  identical  for  541  bp,  which  suggests 
that  the  AFi-related  locus  (Legius  et  al.,  1992)  maps 
to  15qll.2.  However,  there  are  significant  sequence  dif¬ 
ferences  between  the  AFi-related  locus  and  Pl-4  in 
exon  24,  where  only  6  bases  are  identical.  This  observa¬ 
tion  raises  the  possibility  that  these  two  loci  are  also 
distinct. 

Development  of  Primer  Pairs  for  NFl  Mutation 

Analysis 

To  evaluate  the  usefulness  of  our  intron-based,  exon- 
specific  primer  pairs  in  NFl  mutation  analysis,  we 
have  examined  PCR  amplification  of  uncloned  genomic 
DNA  for  exons  1  through  27b,  using  previously  pub¬ 
lished  NFl  primer  sets  (Li  et  aL,  1995).  Some  primer 
pairs  coamplify  from  NFl  and  homologous  loci;  there¬ 
fore,  based  on  sequence  differences,  we  redesigned 
most  primer  sets  to  be  AFi -specific  (Table  3). 

Primer  pairs  for  exons  8,  9,  15,  21,  27a,  and  27b  do 
not  specifically  amplify  NFl  from  uncloned  genomic 
DNA;  therefore,  to  remain  useful  in  mutation  detection 
analysis,  we  attempted  to  separate  PCR  products  am¬ 
plified  from  different  loci  both  by  denaturing  polyacryl¬ 
amide  gel  electrophoresis  and  SSCP.  Sequence  analysis 
of  aberrant  bands  revealed  the  presence  of  basepair 
substitutions,  deletions,  and  insertions  among  exon 
and  intron  sequence  in  the  homologous  loci;  thus,  these 
loci  likely  represent  nonprocessed  pseudogenes.  Radio- 
labeled  PCR  products  amplified  from  total  human  ge¬ 


nomic  DNA,  DNA  specific  to  chromosome  17,  and  DNA 
from  the  homologous  loci  from  the  respective  mono- 
chromosomal  somatic  cell  hybrids  were  subjected  to  de¬ 
naturing  gel  electrophoresis  to  detect  size  shifts.  As 
part  of  this  analysis  the  absence  of  amplification  from 
the  rodent  background  was  documented  for  all  exons 
except  exon  23a,  where  we  were  able  to  separate  hu¬ 
man  from  amplified  rodent  DNA  by  SSCP  analysis. 
Products  for  exons  21,  27a,  and  27b  from  the  individual 
chromosomes  demonstrated  fragment  size  variants  of 
1-3  bp  that  were  easily  separated  on  denaturing  gels. 
For  example,  as  shown  in  Fig.  3  for  27a,  the  PCR  prod¬ 
uct  from  total  human  DNA  amplified  from  both  chro¬ 
mosome  17  and  the  homologous  locus  on  chromosome 
15.  The  difference  in  migration  of  the  PCR  product  rep¬ 
resents  a  basepair  deletion  in  the  chromosome  15  locus 
for  exon  27a,  which  was  confirmed  by  sequence  analysis 
showing  a  basepair  deletion  of  guanine  at  position  4451 
in  the  NFl  cDNA  sequence.  In  addition,  we  performed 
single-stranded  conformer  analysis  (Orita  et  al.,  1989) 
for  the  products  that  did  not  show  a  size  shift  on  dena¬ 
turing  gel  electrophoresis  (exons  9  and  15).  SSCP  anal¬ 
ysis  of  these  products  demonstrated  the  presence  of 
aberrant  bands  indicative  of  multiple  basepair  substi¬ 
tutions,  which,  by  DNA  sequencing,  showed  4  basepair 
substitutions  in  exon  9  and  5  basepair  substitutions 
in  exon  15.  Using  these  techniques,  NFl  mutations 
represented  by  aberrant  bands  distinct  from  defined 
homologous  loci  could  be  identified,  even  when  primers 
coamplify  AFi -homologous  loci. 

DISCUSSION 

Even  though  AFi-related  loci  had  previously  been 
reported  for  sequences  in  the  5'  region  of  the  NFl  gene, 
we  were  surprised  by  the  number  of  non-AFi  genomic 
clones  that  were  isolated  under  normal  stringency 
screening  conditions.  More  surprising  was  the  observa¬ 
tion  that  clones  from  different  loci  amplified  equiva¬ 
lently  sized  products  (by  4%  agarose  gels)  when  intron- 
based,  AFi  exon-specific  primer  pairs  were  used  in 
PCR.  Only  when  PCR  products  were  sequenced  did  it 
become  clear  that  many  clones  represented  AFi -ho¬ 
mologous  loci. 

The  clones  with  the  divergent  sequences  mapped  to 
different  loci  based  on  fluorescence  in  situ  hybridiza¬ 
tion  analysis.  Given  the  degree  of  homology  at  multiple 
exons,  including  flanking  intron  sequences  between  nu¬ 
merous  genomic  clones  representing  different  loci,  it 
was  surprising  that  the  majority  of  clones  hybridized 
to  specific  chromosomal  regions  by  FISH  without  cross- 
hybridizing  to  other  loci.  Three  clones  specifically  hy¬ 
bridized  to  a  single  locus,  while  three  other  non-AFi 
clones  hybridized  to  multiple  loci:  cos  A  represents  the 
homologous  locus  on  21qll.2  by  sequence  identity  and 
cross-hybridizes  to  18pll.2;  Pl-7,  which  likely  repre¬ 
sents  NF1L2  at  2q21,  cross-hybridizes  to  14qll.2;  and 
Pl-15  hybridizes  to  2q33-q34,  14qll.2,  and  22qll.2. 
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EXON  27a 

FIG.  3.  Denaturing  gel  electrophoresis  of  a  PCR  product  for  exon 
27a  from  total  human  genomic  DNA  and  DNA  from  monochromoso- 
mal  somatic  cell  hybrids  for  chromosomes  15  and  17.  The  product 
using  total  human  genomic  DNA  as  a  template  shows  coamplification 
from  the  locus  on  chromosome  15  and  chromosome  17.  The  product 
from  the  chromosome  15  locus  shows  the  presence  of  a  deletion, 
indicated  by  the  shorter  length  of  the  product  in  comparison  with 
the  locus  on  chromosome  17,  which  on  sequencing  was  shown  to  be 
a  single  base  deletion  of  guanine  (position  4451  in  the  NFl  cDNA). 
The  primer  pair  used  is  I27aUP  and  I27aRP  (Li  et  al,  1995).  There 
was  no  amplification  of  rodent  DNA  using  this  primer  set. 


have  been  identified  by  PCR  of  somatic  cell  hybrids; 
neither  locus  is  represented  in  our  set  of  FISH-mapped 
genomic  clones. 

The  most  extensively  characterized  A/Fi  “homologous 
locus  is  the  A^Fi-related  locus  on  chromosome  15  (Leg- 
ius  et  aL,  1992).  In  addition,  a  separate  locus  on  chro¬ 
mosome  15  (NFl-HHS  at  15q24-qter),  which  differs 
from  that  described  by  Legius  et  al.  (1992),  has  been 
reported  by  Gasparini  et  al.  (1993)  based  on  sequence 
divergence  of  a  single  PCR  product  amplified  by  NFl 
primers  derived  from  sequences  in  a  segment  encom¬ 
passing  exon  24.  By  FISH  analysis  we  demonstrate  a 
chromosome  15  locus  that  maps  to  15qll.2  using  both 
Pl-4  and  Pl-10  clones  as  probes.  Comparison  of  the 
iVFi-homologous  locus  on  15qll.2,  represented  by  se¬ 
quence  from  the  two  clones  Pl-4  and  Pl-10,  to  the  NFl- 
related  locus  on  chromosome  15  showed  complete  iden¬ 
tity  for  exons  20,  21,  and  27a,  which  supports  the  local¬ 
ization  of  the  iVFi -related  locus  (Legius  et  al.,  1992)  to 
15qll.2.  However,  the  sequence  from  the  iVFi -related 
locus  cDNA  diverges  remarkably  from  the  sequence 
represented  by  Pl-4  for  exon  24.  We  hypothesize  either 
that  this  may  represent  a  third  locus  on  chromosome 
15  or  that  the  exon  24  sequence  depicted  from  the  NFl- 
related  locus  may  not  represent  the  genomic  sequence. 
The  sequence  reported  by  Legius  et  al.  (1992)  is  a  PCR 
product  from  cDNA  synthesized  by  reverse  transcrip¬ 
tion  of  total  RNA  isolated  from  cultured  human/rodent 
somatic  cell  hybrid  cells  containing  chromosome  15  as 
the  human  complement.  It  is  possible  that  the  primary 
transcript  from  this  homologous  locus  is  processed  dif¬ 
ferently  and  exon  24  is  aberrantly  spliced.  Further¬ 


more,  the  A/Fi-related  locus  sequence  contains  only  12 
bases  of  exon  24,  and  only  9  of  those  12  are  identical 
with  bona  fide  NFl  cDNA  sequence.  This  raises  the 
possibility  that  this  sequence  may  not  truly  represent 
exon  24  sequence  from  the  A/Fl -related  locus  at  the 
genomic  level.  We  have  also  independently  confirmed 
the  existence  of  homologous  loci  on  chromosomes  14 
and  22  (Marchuk  et  al.,  1991)  and  chromosome  21  at 
21qll.2  (Suzuki  et  al,  1994)  by  FISH  and  PCR  of  so¬ 
matic  cell  hybrids. 

Genetic  variation  within  pseudogenes  may  be  im¬ 
portant  reservoirs  that  cause  mutations  by  homologous 
recombination  between  pseudogenes  and  genomic  seg¬ 
ments  encoding  functional  domains.  Conversions  be¬ 
tween  gene  and  pseudogene  have  been  previously  re¬ 
ported  for  a  number  of  genes  (Braun  et  al,  1990;  Sorge 
et  al.,  1990;  Urabe  et  al,  1990;  Matsuno  et  al,  1992; 
Collier  et  al,  1993;  He  and  Grabowski,  1993),  all  of 
which  represent  linked  loci.  The  only  example  of  gene 
conversion  for  unlinked  loci  was  reported  for  the  von 
Willebrand  factor  gene  on  chromosome  12  and  its  pseu¬ 
dogene  on  chromosome  22  (Eikenboom  et  al,  1994). 
This  raises  the  hypothesis  that  homologous  recombina¬ 
tion  involving  intragenic  segments  between  NFl  and 
AFi -homologous  loci  could  be  the  etiology  of  some  AFi- 
disease  causing  mutations  (Cummings  et  al,  1993). 
Mutation  by  gene  conversion  predicts  that  some  NFl 
mutations  would  correspond  to  sequences  from  NFl- 
homologous  loci.  We  have  examined  21  known  AFi  mu¬ 
tations  (NNFF  Mutation  Analysis  Consortium  News¬ 
letter,  June  1995)  that  lie  in  exons  for  which  we  have 
detected  AFi -homologous  segments.  Only  one  muta¬ 
tion,  R1513X  in  exon  27a,  corresponds  to  a  base  change 
in  an  AFi -homologous  locus  on  15qll.2;  however,  this 
mutation  occurs  at  a  highly  mutable  CpG  site.  Thus, 
this  limited  analysis  provides  no  direct  evidence  of  AFi 
gene  conversion. 

AFi -homologous  loci  complicate  mutation  analysis 
of  the  AFi  gene.  We  have  detected  regions  homologous 
to  exons  7,  8,  9,  11,  13,  15,  16,  17,  18,  19a,  19b,  20,  21, 
22,  23-1,  24,  27a,  and  27b  on  chromosomes  2,  12,  14, 
15,  18,  21,  and  22.  To  design  AFi -specific  primers,  we 
sequenced  intronic  segments  from  PCR  products  from 
both  genomic  clones  and  monochromosomal  somatic 
cell  hybrids  representing  AFi -related  loci  and  com¬ 
pared  them  with  bona  fide  NFl  sequence.  New  primers 
were  designed  to  have  natural  mismatches  with  the 
homologous  sequences  at  the  3 '  end  of  the  oligonucleo¬ 
tides.  When  only  a  single  naturally  occurring  mismatch 
was  available,  we  created  an  additional  mismatch  to 
increase  primer  specificity,  whereby  multiple  mis¬ 
matches  would  lead  to  preferential  hybridization  of  the 
primer  to  the  NFl  locus.  Thus,  we  have  compiled  a  set 
of  oligonucleotide  primer  pairs  that  amplify  uncloned 
human  DNA  template  specifically  from  the  NFl  locus 
(Table  3);  only  primer  pairs  for  exons  8,  9,  15,  21,  27a, 
and  27b  are  nonspecific.  PCR  amplification  using  these 
AFi -primer  pairs  has  simplified  the  DNA-based 
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LONG  RT-PCR  OF  NFl 


(ORF)  and  identifed  mutations  in  14  of  21  NFl 
patients  (67%).  A  similar  detection  rate  in  NFl 
patients  was  obtained  with  the  PTT  in  our  own 
laboratory  (H.  Breidenbach;  unpubl.) 

Because  of  limitations  in  translating  large 
proteins  in  vitro  and  in  resolving  large  proteins 
on  polyacrylamide  gels,  PTT  currently  has  an  up¬ 
per  limit  of  ~2  kb  for  the  largest  PCR  product  that 
can  be  screened  effectively.  Therefore,  the  8.5-kb 
ORF  of  NFl  must  be  divided  into  overlapping  seg¬ 
ments  for  amplification.  This  limitation  means 
that  deletion  mutations  large  enough  to  span  the 
entire  region  of  overlap  of  two  consecutive  cDNA 
segments  that  are  the  targets  for  amplification 
will  prevent  amplification  from  the  mutant  al¬ 
lele,  and  these  mutations  will  be  missed. 

Recently,  modified  reaction  conditions  that 
allow  PCR  amplification  of  much  longer  DNA  tar¬ 
gets  were  described  (Barnes  1994;  Cheng  et  al. 
1994).  Long  RT-PCR  of  the  entire  NFl  ORF  fol¬ 
lowed  by  agarose  gel  electrophoresis,  presented 
here,  allows  identification  of  a  great  variety  of 
large  deletions  that  would  be  missed  in  smaller, 
overlapping  RT-PCR  products.  Furthermore, 
small  insertion/deletion  mutations  present  in 
many  of  the  patients  that  show  apparently  nor¬ 
mal  long  RT-PCR  products  can  be  detected  con¬ 
veniently  by  digestion  of  the  long  RT-PCR  prod¬ 
uct  with  Asel  and  Fspl,  followed  by  agarose  gel 
electrophoresis. 

RESULTS 

Figure  1  summarizes  our  strategy  for  mutation 
detection  in  long  RT-PCR  products  from  NTi.  To¬ 
tal  RNA  from  Epstein-Barr  virus  (EBV)- 
transformed  lymphoblasts  or  from  whole  blood 
was  the  template  for  RT-PCR.  Previous  work  has 
shown  that  these  cells  express  both  the  (+)  23A 
and  the  (-)  23A  forms  of  the  NFl  transcript,  but 
do  not  express  either  (-h)  9A  or  (-h)  48A  transcripts. 
Therefore,  the  long  RT-PCR  method  presented 
here  should  amplify,  from  control  individuals, 
only  NFl  (-)  9A  (+)  23A  (-)  48A  (8754  bp)  and 
NFl  (-)  9A  (-)  23A  (-)  48A  (8691  bp). 

Reverse  transcription  is  primed  with  a  spe¬ 
cific  antisense  oligonucleotide  downstream  of 
the  NFl  stop  codon,  PCR  from  the  resulting 
cDNA  is  then  performed  using  a  pair  of  primers 
flanking  the  ORF.  The  product  of  this  long  RT- 
PCR  is  screened  for  deletions  and  insertions  by 
agarose  gel  electrophoresis.  To  allow  detection  of 
smaller  deletions  and  insertions  and  to  narrow 
the  location  of  all  detected  mutations,  the  long 


product  is  then  digested  with  restriction  endonu¬ 
cleases  that  release  a  set  of  fragments  with  unique 
sizes  that  are  well  separated  by  agarose  gel  elec¬ 
trophoresis.  The  restriction  fragments  predicted 
for  an  Asel-Fspl  double  digest  (using  the  Site  Pro¬ 
gram  from  the  Intelligenetics  Suite)  appeared 
suitable  for  this  purpose  and  are  shown  in  Figure  1. 

To  test  its  effectiveness,  we  applied  this  ap¬ 
proach  to  mutation  detection  to  all  of  the  pa¬ 
tients  in  our  NFl  collection  with  known  dele¬ 
tions  or  insertions  30  bp  or  longer  that  fall  en¬ 
tirely  between  exons  1  and  49  of  the  NFl  gene. 
The  mutations  in  NFl  mRNA  of  the  nine  patients 
constituting  this  set  have  not  been  reported  pre¬ 
viously;  their  sizes  and  locations  are  shown  in 
Figure  1  and  Table  1.  Table  1  also  lists  the  ob¬ 
served  and  expected  sizes  of  the  mutant  bands 
before  and  after  restriction  digestion,  and  the 
normal  restriction  fragments  affected  by  the  mu¬ 
tations. 

Figure  2  shows  the  results  of  gel  electrophore¬ 
sis  in  0.8%  agarose  of  long  NFl  RT-PCR  products 
prepared  from  the  nine  NFl  patients  and  three 
controls.  A  band  of  ~8.7  kb  was  shared  by  all  sam¬ 
ples.  Because  the  (+)  and  (-)  23A  forms  of  NFl 
differ  in  length  by  only  63  bp,  resolution  of  the 
mixed  RT-PCR  products  into  two  bands  (8754 
and  8691  bp,  respectively)  is  not  expected  under 
these  conditions.  Agarose  gel  electrophoresis  of 
long  NFl  RT-PCR  product  prepared  from  whole 
blood  RNA  of  a  control  gave  results  equivalent  to 
those  shown  in  the  control  lanes  in  Figure  2. 

Figure  3  shows  the  patient  samples  and  a 
control  after  double  digestion  with  Asel  and  Fspl 
and  additional  electrophoresis  in  a  1.5%  agarose 
gel  until  the  bromophenol  blue  had  migrated 
nearly  to  the  bottom  of  the  gel  (19  cm).  The  six 
restriction  fragments  expected  from  normal  NFl 
RT-PCR  products  were  observed  in  all  samples. 
The  largest  two  fragments,  la  and  Ib,  appeared  to 
have  a  size  difference  consistent  with  alternative 
splicing  of  the  63-bp  exon  23A.  [All  restriction 
digest  data  presented  here  were  obtained  using 
Fspl  and  Asel.  However,  we  now  recommend  that 
Fspl  be  replaced  by  the  isoschizomer  Avill  (see 
Methods)]. 

One  of  the  difficulties  of  mutation  detection 
in  RNA  samples  from  NFl  individuals  arises  from 
the  fact  that  80%  of  patients  show  unequal  levels 
of  mRNA  from  the  two  NFl  alleles  (Hoffmeyer  et 
al.  1995).  In  two-thirds  of  the  patients  tested,  the 
abundance  of  mRNA  from  one  NFl  allele  was 
more  than  twice  that  of  the  other  allele.  That 
study  used  heterozygosity  at  a  single-base-pair 
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Table  1.  NFl  Patients,  Their  Known  Mutations,  and  Mutant  Band  Sizes  Determined  by 
Agarose  Gel  Electrophoresis  Before  and  After  Restriction  Digestion  with  Asel-fspI 


Lane 

no. 

Patient 

no. 

Mutation 

Size  of  mutant 
band  by  AGE: 
observed 
(predicted)^ 

Restriction 

fragment 

affected 

Size  of  mutant 
band  by  AGE 
after  restriction 
digest: 
observed 
(predicted)^ 

1 

11387 

del  exon  37  (102  bp) 

— 

li 

2189  (2182) 

2 

119-01 

del  30  bp  from  exon  9 

— 

IV 

918  (921) 

3 

11389 

ins  1 76  bp  into  exon  9 

— 

IV 

1126  (1127) 

4 

12359 

del  exon  35  (62  bp) 

— 

II 

2229  (2222) 

5 

11573 

del  exons  28-31  (1171  bp) 

7578  (7583  +  23A; 

la,  lb 

1 840  (1 836,  +  23A; 

7520,  -23A) 

1  773,  -23A) 

6 

11360 

del  exons  28-39  (2354  bp) 

6390  (6400,  -F23A; 

la,  lb,  II 

2880  (2937,  +23A; 

6337,  -23A) 

2874,  -23A) 

7 

11344 

del  exon  3  (84  bp) 

— 

V 

627  (628) 

8 

107-06 

del  exon  20  (1 82  bp) 

— 

la,  lb.  III 

4604  (4625,  +23A; 

4562,  -23A) 

9 

94-04 

del  148  bp  from  exon  29 

— 

la 

2865  (2859,  +23A) 

lb 

2802  (2796,  -23A) 

9 

94-04 

del  exon  29  (341  bp) 

8439  (841  3,  +23A; 

la 

2682  (2666,  +23A) 

8350,  -23A) 

lb 

2624  (2603,  -23A) 

“Sizes  of  the  observed  mutant  bands  are  given.  The  numbers  in  parentheses  to  the  right  of  the  observed  sizes  are  the  sizes  predicted 
based  on  previous  independent  analyses  of  the  mutations.  (AGE)  Agarose  gel  electrophoresis.  Band  sizes  were  determined  with  the 
FragmeNT  Analysis  Program  (Molecular  Dynamics,  Inc.). 


the  mutant  band  (Table  1)  from  the  average  size 
for  the  normal  (±)  23 A  RT-PCR  products  (8723 
bp).  The  approximate  location  of  these  deletions 
within  the  long  RT-PCR  product  can  be  deduced 
from  the  sizes  of  the  abnormal  fragments  gener¬ 
ated  by  restriction  enzyme  digestion  (Fig.  3).  In 
patient  11573,  deletion  of  a  single  1.1-kb  seg¬ 
ment  from  the  Asel-Fspl  restriction  map  in  Figure 
1  could  produce  the  1.8-kb  mutant  fragment  ob¬ 
served  after  restriction  digestion  (Fig.  3,  lane  5) 
only  if  the  deletion  falls  within  fragment  I.  Sim¬ 
ilarly,  in  patient  11360,  a  2.3-kb  deletion  could 
produce  the  2.9-kb  mutant  fragment  observed  af¬ 
ter  restriction  digestion  (Fig.  3,  lane  6)  only  if  the 
deletion  takes  out  the  Asel  site  at  the  fragment 
I/fragment  II  boundary. 

Previous  work  has  shown  that  the  mutation 
in  patient  1 1 5  73  is  a  deletion  of  1 1  kb  of  genomic 
DNA  removing  exons  28-31,  and  the  mutation 
in  patient  11360  is  a  deletion  of  40  kb  of  ge¬ 
nomic  DNA  removing  exons  28-39  (Viskochil  et 
al.  1990;  R.  Weiss,  GenBank  accession  no. 
L03723).  In  both  cases,  because  the  deleted  re¬ 


gions  are  well  away  from  the  region  of  exon  23A, 
both  the  (+)  and  (-)  23A  forms  of  NFl  mRNA  are 
expected  to  be  synthesized  from  the  mutant  al¬ 
leles.  Because  of  the  small  size  of  the  length  dif¬ 
ferences,  resolution  of  the  (+)  and  (-)  23A  mutant 
bands  by  gel  electrophoresis  prior  to  restriction 
digestion  was  not  expected.  After  restriction  di¬ 
gestion,  however,  two  mutant  fragments  63  bp 
apart  were  expected  from  each  of  these  two  pa¬ 
tients.  Only  a  single  mutant  restriction  fragment 
from  each  patient  was  observed  (Fig.  3).  It  is  likely 
that  the  lower  mutant  band  from  patient  11573 
did  not  resolve  from  the  normal  1800-bp  band 
and  that  the  higher  mutant  band  from  patient 
11360  did  not  resolve  from  the  normal  3007-bp 
band. 

Patient  94-04 

Continued  agarose  gel  electrophoresis  of  the  ini¬ 
tial,  uncut  long  RT-PCR  products  until  the  full- 
length  control  products  had  migrated  8.5  cm 
from  the  origin  revealed  one  additional  abnor- 
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tion.  Insertion/deletion  mutations  undetected  by 
agarose  gel  electrophoresis  of  the  uncut  long  NFl 
RT-PCR  product  should  be  <300  bp.  Therefore,  in 
these  patients  the  mutant  fragments  resulting 
from  the  Asel-Fspl  digestion  usually  will  migrate 
in  proximity  to  the  corresponding  normal  restric¬ 
tion  fragment,  allowing  easy  approximate  local¬ 
ization  within  the  NFl  ORF.  This  prediction  was 
upheld  for  five  of  the  six  patients  (Fig.  3;  Table  1). 

Exceptions  to  this  rule  include  any  small  mu¬ 
tation  that  removes  one  of  the  restriction  sites 
targeted  for  Asel-Fspl  digestion,  thereby  generat¬ 
ing  a  disproportionately  large  mutant  restriction 
fragment.  The  approximate  location  of  such  mu¬ 
tations  often  can  be  deduced  from  the  size  of  the 
resulting  fragments.  For  example,  a  deletion  of 
<300  bp  from  the  NFl  long  RT-PCR  product  will 
result  in  a  mutant  fragment  -4.6  kb  after  restric¬ 
tion  digestion  (Fig.  3,  lane  8)  only  if  it  removes 
the  Fspl  restriction  site  in  exon  20.  Sequencing  of 
small  RT-PCR  products  spanning  the  exon  20  re¬ 
gion  of  this  patient,  107-6,  has  demonstrated 
that  the  mutant  NFl  mRNA  is  missing  exactly  the 
sequence  of  exon  20. 

The  approximate  location  of  a  mutation  also 
can  be  deduced  when  the  loss  of  signal  intensity 
from  the  position  of  one  or  more  of  the  normal 
restriction  fragments,  relative  to  the  signal  inten¬ 
sity  of  the  other  normal  fragments  in  the  same 
lane,  is  large  enough  to  measure.  The  signal  de¬ 
crease  occurs  because  the  mutant  fragment  shifts 
in  the  gel  lane  away  from  the  position (s)  of  the 
normal  fragment(s).  Examples  of  this  are  the  mu¬ 
tations  in  patients  11360  and  11344.  With  the 
unaided  eye,  it  is  clear  in  Figure  3  that  fragment 
II  (lane  6)  and  fragment  V  (lane  7)  are  diminished 
relative  to  the  other  normal-sized  fragments  in 
the  same  lanes.  Less  obvious  signal  losses  from 
normal  fragments,  resulting  from  other  muta¬ 
tions,  were  identifiable  on  careful  quantitation 
with  Molecular  Dynamics  ImageQuaNT  software. 
However,  this  approach  will  not  succeed  in  all 
cases,  because  of  the  very  low  abundance  of  some 
patients'  mutant  NFl  mRNAs. 

DISCUSSION 

Long  RT-PCR  of  the  entire  NFl  ORF  and  analysis 
of  the  products  on  agarose  gels  before  and  after 
restriction  digestion  with  Asel  and  Fspl  is  shown 
here  to  be  a  highly  effective  way  to  identify  a 
great  variety  of  NFl  mutations.  In  the  assay  re¬ 
ported  here,  for  all  nine  patient  samples  ana¬ 
lyzed,  the  mutant  bands  observed  on  agarose  gels 


before  and  after  restriction  digestion  were  the 
sizes  predicted  based  on  the  patients'  known  in¬ 
sertion/deletion  mutations.  The  greatest  strength 
of  the  assay  is  its  power  to  yield  mutant  RT-PCR 
products  from  deletion  mutations  that  would  be 
missed  by  traditional  methods.  For  example, 
both  of  the  deletions  in  patients  11573  and 
11360  would  not  be  detected  by  the  protein  trun¬ 
cation  assay  described  by  Heim  et  al.  (1995),  be¬ 
cause  the  binding  sites  are  removed  for  both  the 
downstream  primer  of  the  S3  segment  and  the 
upstream  primer  of  the  S4  segment. 

The  power  of  gel  electrophoresis  to  resolve 
bands  of  similar  size  depends  in  part  on  the  lo¬ 
cation  of  the  bands  relative  to  the  electrophoretic 
front  during  electrophoresis  (R^=  distance  of 
band  from  the  origin/distance  of  the  electro¬ 
phoretic  front  from  the  origin);  the  best  resolu¬ 
tion  is  an  Ry-near  0.5.  Because  the  2%  difference 
in  length  between  the  normal  fragments  la  and 
Ib  was  resolvable  at  an  R^of  -0.31,  it  is  likely  that 
the  smallest  possible  single  exon  dropout  from 
lymphoblast  NFl  mRNA,  the  47-bp  exon  47, 
would  be  detected  also:  It  contributes  2,1%  of  the 
length  of  fragment  II,  which  has  an  R^of  -0.36  in 
these  1.5%  agarose  gels. 

These  considerations  also  suggest  that  a  few 
samples  with  no  apparent  abnormalities  by  a  sin¬ 
gle  gel  analysis  of  the  Asel-Fspl  digestion  prod¬ 
ucts  may  reveal  small  deletions  or  insertions 
upon  re-electrophoresis  of  the  samples  on  two 
additional  gels,  one  with  a  lower  agarose  concen¬ 
tration  to  optimize  resolution  of  fragments  la,  Ib, 
II,  and  III,  and  the  other  with  a  higher  agarose 
concentration  to  optimize  resolution  for  of  IV 
and  V. 

Occasionally,  mutant  restriction  fragments 
will  be  missed  when  they  are  of  low  intensity  and 
indistinguishable  in  size  from  one  of  the  faint 
background  bands  appearing  in  all  samples.  Be¬ 
cause  the  sizes  of  several  of  these  faint  bands  are 
determined  by  the  positions  of  the  recognition 
sites  for  the  restriction  enzymes,  this  problem 
may  be  solved  by  use  of  a  second  set  of  restriction 
enzymes  to  cut  the  long  NFl  RT-PCR  product,  in 
place  of  the  Asel-Fspl  combination. 

The  assay  presented  here  should  detect  -30% 
of  the  mutations  reported  to  date  to  the  NFl  Mu¬ 
tation  Consortium  (Korf  1995).  Currently,  we  are 
in  the  process  of  testing  its  detection  rate  in  a  set 
of  unrelated  NFl  patients.  Expressed  as  muta¬ 
tions  identified  per  NFl  RT-PCR  product,  this  as¬ 
say  should  far  surpass  all  assays  reported  previ¬ 
ously.  It  will  be  particularly  useful  in  the  detec- 
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Restriction  Enzyme  Digest  and  Analysis 

For  each  RT-PCR  product,  the  PCR  buffer  was  exchanged 
with  water  using  a  Centricon-100  concentrator.  Five  mi¬ 
croliters  of  the  8.7-kb  PCR  product  and  X  Hindlll  mass 
standards  was  analyzed  on  1  x  Tris-acetate-EDTA  (TAE) 
0.8%  agarose  gels  containing  ethidium  bromide  and  pho¬ 
tographed  with  transillumination  by  UV  light.  Between 
200  and  870  ng  (~20  pi)  of  each  sample  was  then  digested 
with  10-20  units  Fspl  (New  England  Biolabs)  under  min¬ 
eral  oil  in  a  final  aqueous  volume  of  50  pi  containing  50 
mM  Tris-HCl  (pH  8.0)  and  10  mM  MgCl2;  at  37°C  for  4  hr. 
The  enzyme  was  then  heat  inactivated  at  68°C  for  20  min. 
Between  10  and  20  units  of  Asel  (New  England  Biolabs) 
and  additional  reagents  were  then  added  to  a  final  volume 
of  70  pi  containing  the  following:  50  mM  Tris-HCl  (pH 
8.0),  10  mM  MgCl2,  100  mM  NaCl,  and  1  mM  DTT.  Incu¬ 
bation  was  then  carried  out  at  37°C  for  4  hr,  followed  by 
heat  inactivation  at  68°C  for  20  min.  After  consecutive 
digestion  with  Fspl  and  Asel,  the  sample  was  concentrated 
by  Centricon-100  centrifugation  and/or  evaporative  cen¬ 
trifugation  in  a  Savant  Speed-Vac  and  analyzed  on  a  20-cm 
1  X  TAE  1.5%  agarose  gel  until  the  bromophenol  dye  front 
had  traveled  nearly  the  length  of  the  gel.  Following  elec¬ 
trophoresis,  the  gel  was  stained  with  a  1:10,000  dilution  of 
stock  Sybr  Green  I  nucleic  acid  gel  stain  (Molecular  Probes, 
Inc.)  in  1  X  TAE  for  15  min  and  scanned  using  a  Fluorlm- 
ager  (Molecular  Dynamics)  with  the  530  DF30  yellow  filter 
in  place. 

The  data  in  Figure  3  were  obtained  using  the  proce¬ 
dure  described  above  for  double  digestion  of  the  NFl  RT- 
PCR  product.  Recently,  however,  we  were  successful  in 
greatly  simplifying  the  protocol  for  restriction  digestion. 
Avill  (Boehringer-Mannheim)  is  an  isoschizomer  of  Fspl 
that  has  a  recommended  buffer  for  digestion  that  is  iden¬ 
tical  to  that  of  Asel  (New  England  Biolabs).  We  therefore 
replaced  Fspl  with  Avill,  and  found  that  we  could  digest 
the  NFl  long  RT-PCR  product  simultaneously  with  a  mix¬ 
ture  of  Avill  and  Asel.  In  our  current  protocol,  upon  com¬ 
pleting  the  long  PCR  step,  100-200  ng  of  PCR  product 
(10-20  pi)  was  diluted  directly  into  a  final  volume  of  25-30 
pi  containing  1  x  Asel  buffer  (New  England  Biolabs),  4  mM 
spermidine,  1  mM  DTT,  and  10  units  each  of  Avill  and  Asd. 
Incubation  was  done  at  37°C  for  1  hr.  The  digestion  prod¬ 
ucts  were  then  analyzed  by  agarose  gel  electrophoresis  as 
described  above. 


Estimation  of  Band  Sizes  on  Agarose  Gels 

All  length  determinations  of  gel  bands  were  performed 
using  the  Molecular  Dynamics  Fragment  Analysis  soft¬ 
ware,  version  1.1,  running  under  Microsoft  Windows  NT 
on  an  Intel  486-based  computer.  The  point-to-point  loga¬ 
rithmic  option  for  size  analysis  was  applied.  The  size  stan¬ 
dard  was  the  GIBCO-BRL  1-kb  ladder. 
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